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DECLARATION UNDER 37 C.F.R. SI, 132 

I, Theresa White hereby declare as follows: 

1) That I obtained my Ph.D. Biological in Chemistry, from Massachusetts Institute of 
Technology, Feb. 1991, and that I have over 15 years of experience in conducting and 
directing research in the areas of Biochemistry, Cell and Molecular biology, and 
Microbiology including Functional high-level expression of proteins in fungal, bacterial, 
yeast, and plant systems, Biochemical and structural studies on hydrolase and redox 
enzymes, Optimization of enzyme properties by directed and random mutagenesis, 
Optimization of fungal strains and fermentation processes for enzyme production. 

2) That I am a Senior Research Scientist, Iogen Corporation - Ottawa, Ontario, Canada 
from March 1 994 - present. Prior to this position I was a Post-Doctoral Research Fellow, 



Loeb Medical Research Institute - Ottawa, Ontario, Canada,from October 1 993 - March 
1994, and a Post-doctoral Fellow and Research Scientist, Agriculture Canada, Ottawa, 
Ontario, Canada from December 1 990 - October 1 993. 

3) That I have reviewed and understand the specification and the prosecution history of 
the 09/990,874 application. 

4) That this Declaration is submitted in order to clarify for the record what one. of ordinary 
skill in the artwould understood from the prior art at the time the present invention was 
made. 

5) That glycosyl hydrolases are enzymes that catalyze hydrolysis of glycosidic linkages 
between covalently linked sugar moieties. Traditionally, enzymes, are classified by the 
Enzyme Commission biased upon the chemistry they catalyze (hydrolysis, oxidation, 
ammonium transfer, etc). In the case of glycosyl hydrolases, much effort lias gone into die 
development of a systematic nomenclature that identifies the enzymes by their substrate 
preference(s) and by their catalytic mechanism and structural features. Starting in 1991, 
work conducted primarily by Bernard Henrissat showed that glycosyl hydrolase enzymes 
can be classified into Families in which members not only share a common three- 
dimensional structure, but also regions of conserved amino acid homology and common 
catalytic mechanisms (Davies and Henrissat, 1995, Structure 3: 853-859).. Xylanases 
(EC3.2.1.8) were among the first group of glycosyl hydrolases classified based on this 
system and are found almost exclusively in Family 10 and Family 11. At present, there are 
over 70 Glycosyl Hydrolase families and this system of nomenclature has been adopted 
internationally for use in publications and sequence databases. As a result, one can know 
the three-dimensional structure and mechanism of a protein simply by knowing the 
Glycosyl Hydrolase family to which it belongs. 

6) That proteins are classified as Family 11 xylanases if (a) they exhibit the ability to 
hydrolyze internal beta-1,4 glycosidic bonds between adjacent xylose residues in the main 
chain of the xylan polymer with retention of configuration at the anomeric carbon and (b) 



they exhibit the primary and secondary structural signatures associated with Family 1 1 
xylanase. Prior to the filing date of the application referred to in paragraph 3 above, 
dozens of amino acid sequences for Family 1 1 xylanases had been determined (including 
those in Figure 1 in the current application). In addition, about a dozen three-dimensional 
structures of Family 1 1 xylanases had been determined by NMR and x-ray crystallography 
and. their structures published and co-ordinates deposited in the protein data bank (see 
attached Figure 1). At the present time, there are numerous Family 11 xylanase structures 
in the Protein Data Bank. Attached Figure 1 shows the 3-dimensional structures of 5 
fungal and three bacterial Family 11 xylanases from PDB files. 

7) That the high conservation of sequence and structural homology that defines Family 1 1 
xylanases continues to be supported in the literature. In 2002, Sapag et al. (2002, Journal 
of Biotechnology 95: 109-131) conducted amino acid sequence alignments of 82 Family 
11 xylanases (most of which were available prior to die filing date of the application 
referred to in paragraph 3 above) ranging in lengdi from 1 73 to 220 amino acids arid 
identified highly conserved signature sequences in beta strands B5, B6 arid B8 as well as in 
the alpha helix. 

8) That in 2003, Hakulinen et al. (2003, Eur. J. Biochem. 270: 1399-1412) conducted 
pairwise structural comparisons of ten Family 11 xylanases exhibiting 31-97% identity in 
amino acid sequence using molecular co-ordinates of the main chain C-alpha atoms and 
showed that the root-mean-square deviation (rmsd) ranged from 0.6 to 1.4 A for a given 
pair of structures. This level of deviation is within the typical resolution of most X-ray 
crystal structures indicating that natural deviation of three-dimensional structure of Family 
1 1 xylanases is very small. 

9) That all Family 1 1 xylanases contain two conserved glutamate residues at positions 86 
and 177 (see Figure 1 of the application referred to in paragraph 3 above; based on 
Trichoderina reesei xylanase II (TrX II, or Tr2) amino acid numbering), which are located 
on beta-strands B4 and B5 (Torronen, et al., 1994, EMBO J. 13: 2493-2501). 



10) That assays that specifically detect xylanase activity, such as the enzyme-dependent 
release of reducing sugars from xylan substrates as is disclosed in the present application, 
were widely available at the time of filing of the application referred to in paragraph 3 
above. 

11) That the functional expression of several Family 11 xylanases from bacterial and 
fungal sources had been demonstrated in E coli and Saccharomyces cerevisiae (for 
example, Yang et al., 1989, Appl. Environ. Microbiol. 55: 1192-1195; LaGrange et al. 9 
1996, Appl. Environ. Microbiol. 62: 1036-1044). 

12) That one of ordinary skill in the art at the time of the filing of the application referred 
in paragraph 3 above, could develop a thermophilic, thermostable or alkalophilic Family 
1 1 xylanase by 

• using degenerate primers for the highly conserved signature sequences of 
Family 1 1 xylanases to clone the gene from a microbial source and functionally 
express the enzyme in an expression host such as E. coli] 

• determine the sequence of the cloned Family 11 xylanase and conduct a 
BLAST alignment against other known Family 11 xylanase sequences to 
identify regions of high and low homology; 

• use standard molecular biology methods to generate site-specific amino acid 
substitutions of replacement of several codons with homologous codons from 
other Family 1 1 xylanase genes; and 

• express the modified xylanases from E. coli and assess their thermostability, 
thermophilicity and/or alkalophilicity using known xylanase assays. 

13) That the thermostability, thermophilicity or alkalophilicity of a Family 1 1 xylanase 
can also be achieved by Directed Evolution, first developed in the late 1990's. Directed 
Evolution requires (a) functional expression of the xylanase gene in a host microbe that can 
be transformed with high efficiency, such as the E. coli expression systems for xylanases 



presented above; (b) a method for randomly mutating the Family 1 1 xylanase gene with a 
controlled mutation frequency, such as by error-prone PCR or DNA "shuffling"; and (c) a 
highly selective high throughput screening assay that can identify active variants with 
targeted, properties. 

14) That the techniques required, for Directed Evolution were available prior to the filing 
date as demonstrated by Chen et ah (2001, Can. J. Micro. 47: 1088-1094), who used 
Directed Evolution to produce an alkalophilic variant of the Fiamily 11 xylanase from 
NeocaUimastic patriciarum. In this study, xylan-containing agar was overlaid on top of E. 
coli colonies expressing variants of the N. patriciarum xylanase to detect those clones 
expressing active xylanases. Thus, one of skill can readily identify xylanase variants that 
retain activity from a large population of variants using a simple agar plate assay. Only 
those clones expressing active xylanases are picked and grown in liquid culture to produce 
xylanase variants that will be tested for improved themiostability* alkalophilicity or 
tiiermophilicity. 

15) That one of ordinary skill in the art could develop a thermophilic, thermostable or 
alkalophilic Family 1 1 xylanase by 

• using degenerate primers for the highly conserved signature sequences of 
Family 11 xylanases to clone the gene from a microbial source and functionally 
express the enzyme in an expression host such as E: coli; 

• generate random mutagenesis libraries by error-prone PCR and clone the 
library of variants with high efficiency into an expression' host such as E. coli; 

• overlay the E. coli colonies with xylan-containing agar, incubate the plate and 
identify those colonies producing clearing zones, indicative of xylanase 
activity; 

• pick and grow the clones producing active xylanase in liquid culture; freeze a 
portion of the culture for DNA isolation; 



• prepare a cell lysate to release the xylanase variant produced by each culture 
and then conduct a standard xylanase assay at elevated temperature 
(thermophilicity), elevated pH (alkalophilicity) or under optimal pH and 
temperature after a brief exposure to elevate temperature (thermostability); and 

• isolate the DNA from the save portion of the liquid cultures for those clones 
expressing improved variants, perform DNA sequencing and identify the; 
favorable mutation. 

16) That there is riot only sufficient information concerning the conservation of three- 
dimensional structures and sequences among Family 1 1 xylanases, but there is also a large 
body of information concerning tiieir functional expression in common cloning hosts (e.g., 
E. coli) and a variety of assay formats for detecting their activity. 

17) That, determining whether or not a given mutation is compatible with the mutations at 
positions 116, 118, 144 and 161, as presently claimed in the application referred to in 
paragrach 3 above, can be determined with ease by one of skill in the art by measuring the 
thennophilicity, alkalophilicity and/or thermostability after introducing such mutation(s). 

1 8) That I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be true; and further 
that these statements were made with the knowledge that willful false statements and the 
like so made are punishable by fine and imprisonment, or both, Under Section 1001 of 
Title 1 8 of the United States Code and that such willful false statements may jeopardize the 
validity of the application or any patent issued thereon 

EXECUTED at Ottawa, Canada this 20 day of June, 2007. 

B y 'TVi, fV jU^Tx 
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Figure 1 (attached) : 



Each Family 1 1 xylanase three-dimensional structure is identified with the PDB number 
(The Protein Data Bank: http://www.rcsb.org/pdb/home/home.doy Catalytic residues are 
shown in dark gray stick and identified using the numbering from Figure 1 in the present 
specification. 



Trichoderma reesei XYN n (PDB 1ENX) Trichoderma reesei XYN I (PDB 1XYN) 




Aspergillus kawachii XynC (PDB 1UKR) Thermomyces lanuginosus Xyn (PDB YNA) 




Bacillus circulans (PDB 1XNB) Bacillus subtilis (PDB 1IGO) 
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GIDEO N PAV1ES AND BERNARD HENR1SSAT ^' NIR1 

Structures and mechanisms of glycosyl hydrolases 

The wealth of information provided by the recent structure determinations 
of many different glycosyl hydrolases shows that the substrate specificity 
and the mode of action of these enzymes are governed by exquisite details 
of their three-dimensional structures rather than by their global fold. 



Structure 15 September 199.5, 3:85,3-859 



Carbohydrates show wide stereochemical variarion and 
can be assembled in so many different fashions, chat there 
are over 10 12 possible isomers for a reducing hexasaccha- 
ride [1]. Living organisms take advantage of this diversity 
by using oligosaccharides and polysaccharides for a 
multitude of biological functions, 6om storage and struc- 
ture to highly specific signalling roles. Selective hydrolysis 
of glycosidic. bonds is therefore crucial for energy uptake, 
cell wall expansion and degradation, and turnover of sig- 
nalling molecules. As a consequence of saccharide diver- 
sity, there is great variety amongst the enzymes that 
hydrolyze glycosidic bonds, the O-glycosyl hydrolases 
(EC 3.2,1.x). Heritable deficiencies in glycosyl hydro- 
lases, for example lactose intolerance [2] and a number of 
lysosomal storage diseases [3]. are ^ among the most 
frequent genetically based syndromes in man. 

Mechanisms 

Enzymatic hydrolysis of the glycosidic bond takes place 
via general acid catalysis that requires two critical residues: 
a proton donor and a nucleophile/base [4,5] (Fig. 1). This 
hydrolysis occurs via two major mechanisms giving rise to 
either an overall retention, or an inversion, of anomeric 
configuration [4]. In both the retaining and the inverting 
mechanisms, the position of the proton donor is identical, 
in other words it is within hydrogen-bonding distance of 
the glycosidic oxygen. In retaining enzymes, the nude o- 
philic catalytic base is in close vicinity of the sugar 
anomeric carbon. This base, however, is more distant in 
inverting enzymes which must accommodate a water 
molecule between the base and the sugar. This difference 
results in an, average distance between the two catalytic 
residues of -5.5 A in retaining enzymes, as opposed to 
-10 A in inverting enzymes [6]. 

Lysozymes were the first glycosyl hydrolases to have their 
three-dimensional (3D) structures solved [7,8]. The two 
catalytic amino acids were identified as asparcate and 
glutamate residues. In most glycosyl hydrolases studied 
since, only aspartate and/or glutamate residues have been 
found to perform catalysis. Recent data, however, suggest 
that other residues may sometimes be involved in 
glycosidic bond cleavage. Typical examples are viral 
neuraminidase and bacterial sialidase, where the 
transition state is thought to be stabilized with the help of 
a tyrosine [9,10], 
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Fie 1 The two major mechanisms of enzymatic glycosidicbond 
hydrolysis as first proposed by Koshland |4). (a) The retaining 
mechanism, in which the glycosidic oxygen is protonated by the 
acid catalyst (AH) and nucleophilic assistance to aglycon depar- 
ture is provided by the base B". The resulting glycosyl enzyme, is 
hydrblyzed by a water molecule and this second nudeopnthc 
substitution al the anomeric carbon generates a product with the 
same stereochemistry as the substrate, (b) The inverting mecha- 
nism, In which pronation of the glycosidic oxygen and aglycon 
departure are accompanied by a concomitant attack of a water 
molecule that is activated* by the base residue (B"). This- single 
nucleophilic substitution yields a product with opposite stereo- 
chemistry to the substrate. 

Enzymatic hydrolysis of certain oligosaccharides may also 
take advantage of the natural chemistry of the substrate. 
Participation of the substrate C2 acetamido group in 
catalysis by lysozymes and some retaining, chinnases has 
been suggested from several undented sources. Hydro- 
lysis by hen egg white lysozyme (HEWL) of N-acetyl- 
chitobioside substrates with either a C2 hydroxy! _oi : C2 
'acetamido group has been studied in detail [11J. 
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Removal of chc C2 acetamido group leads to a reduction 
in k C3J of at lease 100-fold, a factor which is similar, if not 
greater, than the effect of mutation of the proposed cat- 
alytic base Asp52. Perhaps the most serious evidence for 
the role of the acetamido substituent at C2, both in the 
enhancement of the catalytic rate and the retention of 
the anomeric configuration during hydrolysis, comes 
from solution studies on /\ L aceeylglucosamine-containing 
saccharides [12,13]. Hydrolysis of these sugars, in solu- 
tion, not only goes faster than that of C2 hydrox7l-sub- 
stituced sugars, but with overall retention of anomeric 
configuration, suggesting that, 'on enzyme', a catalytic 
base, such as Asp52 in HEWL, is not necessarily 
required. These observations may also help to, explain the 
apparent anomaly that some enzymes known -to catalyze 
the cleavage of pi, 4 bonds adjacent to C2 acetamido- 
substituted saccharides, whose structures have recendy 
been determined, appear to lack a suitable catalytic base 
located in an appropriate position. Soluble lytic trans- 
glycosylase [14], goose lysozyme [15], hevamine [16], 
and others fall into this category. 

Glycosyl hydrolases have developed various ways to lower 
the energy barrier of the hydrolysis reaction, such as 
substrate distortion into a sofa or half-chair conformation 
[17,18]. It is believed that protonation of the glycosidic 
bond is accompanied by a substantial lengthening of this 
bond [19]. Oligosaccharide complexes of the endo- 
glucanase V from Humicola insotens suggest that this 
enzyme has evolved a 'stretched* subsite at the catalysis 



point in order to favour binding of this elongated 
transition state over that of the ground state substrate 
molecule (G Davies, S Tolley. B Henrissat, C Hjort and 
M Schiilein, unpublished data). 

Families and folds 

A classification of glycosyl hydrolases into more than 45 
families, on the basis of similarities in amino, acid 
sequence, has recendy been proposed [20,21], with the 
prospect that this' may facilitate the derivation of useful 
information on the structure and function of these 
enzymes. Underlying this classification was the idea that 
proteins in a given family would have a fold sufficiently 
similar to allow homology modelling. In this classifica- 
tion, enzymes with different substrate specificities are 
sometimes found in the same family, indicating an evolu- 
tionary divergence to acquire new specificities, as is 
found in, for example, families 1, 13 and 16. On the 
other hand, enzymes that hydrolyze the same substrate 
are sometimes found in different families. For example, 
cellulases are found in 11 families. In other words, the 
sequence, and hence structural, classification differs 
significantly from that of the International Union of 
Biochemistry (IUB) nomenclature of enzymes, which is 
based mostly on substrate specificity. 

Because the 3D structures of proteins are more highly 
coriserved than their sequences, several sequence-based 
families may have related folds. For instance, a structural 
similarity was suggested for family 11 xylanases and 



Table 1. Structures and mechanisms in various families of glycosyl hydrolases. 



Family 


Enzyme 


1 


0-glucosidase 


2 


p-ga!actosida$e 


5 


endoglucanase A 


6 


cellobiohydrolase fl 




endoglucanase 


7 


cellobiohydrolase 1 




endoglucanase 1 


9 


endoglucanase D 


10 


xylanase A 


11 


xylanase 


13 


a-amylase 


14 


P-amylase 


15 


glucoamylase 


16 


P-1,3-l,4-glucanase 


17 


(3-1,3-1,4-glucanase 


18 


chitinase 


19 


chitinase 


20 


chitobiase 


22 


lysozyme 


23 


lysozyme 


24 


lysozyme 


33 


sialidase 


34 


neuraminidase 


45 


endoglucanase V 



Organism 



EC number 


PDB' code 


Mechanism 


Reference 


3.2.1.21 




retaining 


[311 


3.2.1.23 


1 BCL 


retaining 


1321 


3.2.1 .4 




retaining 


1331 


3;2.1.91 


3CBH 


inverting 


(281 


3.2.1.4 


1TML 


inverting 


(341 


3.2.1.91 


1CEL 


retaining 


[35J 


3.2.1.4 




retaining 


[Davies, unpublished] 


3.2.1.4 




inverting 


[36] 


3.2.1.8 


1XAS 


retaining 


[37J 


3.2.1.8 


1BCX 


retaining 


[381 


3.2.1.1 


6TAA 


retaining 


I39J 


3.2.1.2 


TBTC 


inverting 


[401 


3.2.1.3 


3CLY 


inverting 


[41] 


3.2.1.73 


1BYH 


retaining 


142] 


3.2.1.73 


1CHR 


unknown* 


[43] 


3.2.1.14 


1CTN 


retaining 


[44] 


3.2.1.14 


1 BAA 


inverting 


[45] 


3.2.1.52 




retaining 




3.2.1.17 


1HEL 


retaining 


[7] 


3.2.1.17 


153L 


unknown 


115] 


3.2.1.17 


1LYD 


unknown 


[46] 


3.2.1.18 


2SIL 


retaining 


[10] 


3.2.1.18 


1NSB 


retaining 


[47] 


3.2.1.4 


1ENG 


inverting 


[48] 



Trifolium repens 
Escherichia coli 
Clostridium cellulolyticum 
Trichoderma reesei 
Thermononospora fusca 
Trichoderma reesei 
Humicola tnsolens 
Clostridium thermocellum 
Streptomyces lividans 
Bacillus circulans 
Aspergillus oryzae 
Glycine max 
Aspergillus awamori 
Bacillus sp. 
Hordeum vulgare 
Serratia marcescens 
Hordeum vulgare 
Serratia marcescens 
Hen egg white 
Goose 

Bacteriophage T4 
Salmonella typhtmurium 
Influenza virus B 
Humicola tnsolens 



•Prolein Data Bank. 1 This family Is predicted to have a retaining mechanism [24,25). *J Tews, Z Dauter, KS Wilson & CE Vorgias, 
[abstract 038], 4th European Workshop on Crystallography of Biological Macromolecules, Como, Italy, May 1 995. 
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r milv 12 cellulases [22], whereas family 7 cellulases have 
Sfoundtc, have J arrangement of catalytic rescues 
SSPi feldspar to those of the B-1.3-glucanases and 
B 1 3-1 4-glucanases of family 16 [23]. More recendy 
ftmilies'l, 2, 5, 10, 17, 30, 35,39 and 42 were : proposed 
to have evolved from a common ancestor [24,2a]. All ot 
these family groupings substantiate the strict conservation 
of the catalytic machinery and mechanism during evolu- 
tion. A grouping indicating structural similarity -for vari- 
ous lysoVnes (families 22, 23 and 24 ( and family 19 
plant chitinases has also been proposed [26]. This group- 
ie however, includes enzymes known to operate with 
Mention of configuration, such as HEWL, and some 
with a substantially different arrangement of catah/nc 
amino acids, such as family 19 plant chitinases, which are 
inverting enzymes [27]. 

Table 1 reports the various glycosyl hydrolases families 
for which at least one 3D structure has been determined, 
together with the mechanism of glycosidic bond hydro- 
lysis in cases where it is known. The substrates for these 
enzymes are shown in Figure 2. Many glycosyl hydrolases 
have a modular structure consisting of a catalyuc domain 
and one or more non-catalytic domains, some of which 
are involved in substrate binding, but most of which have 
unknown functions. Figure 3 shows the main folds found 



in the catalytic domains of selected glycosyl hydrolases 
and, for families 6 and 7, the comparison of the struc- 
tures of cellobibhydrolases with those of the correspond- 
ing endoglucanases (see below). 

Active-site topologies . . • l 

Although many protein folds are represented, in the 
2-> famines for which a 3D structure is known, the 
overall topologies of the active sites fall into only three 
general classes, regardless of whether the enryme is 
inverting or retaining. These three, topologies (Fig. 4) 
can. in principle, be built on the same fold, with the 
same catalytic residues. 

Pocket or crater . . 

This topology (Fig. 4a) is optimal for the recogninon ot a 
saccharide non-reducing extremity and is encountered in 
monosaccharidases such as 0,galactosidase, (5-glucosidase. 
sialidase and neuraminidase, and in exppolysaccharidases. 
such as glucoamylase and B-amylase. Such exopolys- 
accharidases are adapted to substrates having a large num- 
ber of available chain ends, such as native starch granules, 
whose radial structure exposes all the non-reducing chain 
ends at the surface. On the other hand, these enzymes are 
not very efficient for fibrous substrates such as nanve 
cellulose, which has almost no free chain ends. 



Fig. 2. Structures of the substrates for the 
enzymes given in table 1. The hydro, 
lyzed bond is shown in red. (a) Una- 
marin (substrate for cyanogenic 
B-glucosidase; family 1). (b) Lactose 
(p-galactosidase, family 2). (c) Cellulose 
(endoglucanases and cellobiohydro- 
'lases, families 5, 6, 7, ? and 45). 
(d) Xylan (xylanases, families 10 and 
11). (e) Amylose (a-amylase, family 1 3; 
P-amytase, family 14; and glucoarriy- 
lase. family 15). (0 Mixed p-1 ,3-1,4- 
glucan with the bond hydrolyzed by 
P-1,3-glucanases (family 16) in red and 
that cleaved by p-1 ,3-1 , 4-glucanases . 
(family 17) in cyan, (g) C.nitin (chiti- 
nases in families 18 and 19, family 22 
lysozymes). (h) Chitobiose (chitobiase, 
family .20). (i) Bacterial cell wall poly, 
mer consisting of' alternating rV-aceryl- 
muramic acid and N-acetylglucosamine 
units (lysozymes In families 22, 23 and 
24). (j) Glycoconjugate-linked sialic 
acid (sialidase, family 33 and neu- 
raminidase, family 34). 
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Family 1 



Family 2 



Family 5 




Family 7 
(endoglucanase) 



Family 1 5 




Family 23 




Family9 



Family! 0 




Family 16 



Family 17 




Family 24 



Family 33 




Family 6 
(cellobiohydrolase} 

A) 




Family 11 




Family IB 



Family 34 





Family 6 Family? 
fendoglucanase) (cellobiohydrolase) 




Family 13 




Family 1 4 




Family 19 



Family 22 




Family 45 



Fig 3. Ribbon representation of the. main fold of the catalytic domain in various glycosyl hydrolase families (see Table 1 and text) 
P strands are shown in cyan and a helices in red. (Figure produced using the program MOLSCRIPT [49] ) 



Cleft or groove 

This 'open* structure (Fig. 4b) allows a random binding: 
of several sugar units in polymeric substrates and is 
commonly found in endo-acting polysaccharidases such 
as lysozymes, endocellulases, chitinases, a-amylases, 
xylanases, (3-1 ,3-1,4-glucanases and P-l,3-glucanases. 

Tunnel 

This topology (Fig. 4c) arises from the previous one 
when the protein evolves long loops that cover part of 
the cleft. Found so far only in cellobiohydrolases, the 
resulting tunnel enables a polysaccharide chain to be 
threaded through it [28]. A comparison of the cellobi6- 
hydrolases in families 6 and 7 with the corresponding 
endoglucanases is shown in Figure 3. The loops that 
cause the catalytic centres of cellobiohydrolases to lie. 
within enclosed tunnels can be seen clearly and, for the 
family 6 enzymes, are also illustrated by the surface repre- 
sentations in Figures 4b and 4c. This topology allows 
these enzymes to release the product while remaining 
firmly bound to the polysaccharide chain, thereby 
creating the conditions for processivity (Fig. 5). It 
remains unclear at present whether the substrate initially 



penetrates the active sice in an 'exd' fashion by one of the 
two entrances of the tunnel or whether the loops, that 
close the active site can 'open* occasionally to allow a 
random binding followed by the processive action. In 
either case it should be noted that, depending on the 
mechanism (inverting or retaining) and the exact position 
of the cleavage point with respect to the several subsites, 
the directionality of the enzyme motion along the chain 
may change. For 'instance, cellobiohydrolase II of 
Trichoderma reesei proceeds towards the reducing end of 
cellulose, whereas the reverse was suggested for cellobio- 
hydrolase I from the same organism [23], Processivity is 
probably a key factor for the efficient enzymatic degra- 
dation of insoluble micro crystalline cellulose. 

Concluding remarks 

Orengo et al. [29] have shown that certain protein folds 
(superfolds) occur more often than others. More 
precisely, only nine superfolds are sufficient to describe 
the folding in ~3Q% of all proteins and it is thought that 
the total number of protein folds is not more than a 
few thousand [29,30]. So far, from the 22 families of gly- 
cosyl hydrolases for which ;a 3D structure has been 
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Fig. 4, The three types of active site 
found in glycosyl hydrolases, (a) The 
pocket {glucoamylase from A avvamon). 
(b) The cleft (endoglucanase E2 from 
T. fuscal (c) The. tunnel (eel I obi o hydro- 
lase II from T. reesei). The proposed cat- 
alytic residues are shaded in red. 
(Molecular surface diagrams were pre- 
pared using, the MO LV I EWER program 
|M Hartshorn, unpublished program).) 
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determined, nine are of the TIM barrel type (Table 1). In 
^addition, detailed sequence comparisons have, suggested 
that several other families also share the same fold, imply- 
ing that, for glycosyl hydrolases, the bias towards this fold 
could be even more important [24]. . 

There are presently 52 families in the classification based 
on sequence similarities (B Henrissat and A Bairoch, 
unpublished data); 3D structures are known for about 
40% of these families. The first enzyme structure to be 
solved, more than 30 years ago, was that of hen egg white 
lysoryme, a glycosyl hydrolase. At. the present pace of 
structural investigations, the 3D fold for all of the remain- 
ing families could be determined within a few years. 
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Abstract 

Eighty-two amino acid sequences of the catalytic domains of mature endoxylanases belonging to family 1 1 have 
been aligned using the programs matchbox and clustal. The sequences range in length from 175 to 233 residues. 
The two glutamates acting as catalytic residues are conserved in all sequences. A very good correlation is found 
between the presence (at position 100) of an asparagine in the so-called 'alkaline' xylanases, or an aspartic acid in 
those with a more acidic pH optimum. Four boxes defining segments of highest similarity were detected; they 
correspond to regions of defined secondary structure: B5, B6, B8 and the carboxyl end of the alpha helix, respectively. 
Cysteine residues are not common in these sequences (0.7% of all residues), and disulfide bridges are not important 
in explaining the stability of several thermophilic xylanases. The alignment allows the classification of the enzymes in 
groups according to sequence similarity. Fungal and bacterial enzymes were found to form mostly separate clusters 
of higher similarity, © 2002 Elsevier Science B.V. All rights reserved. 

Keywords: Family 1 1 endoxylanases; Factor analysis classification; Sequence alignment; Structural relationships 
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1. Introduction 

Xylan, a heteroglycan, is the main constituent 
of plant hemicelluloses. It is composed of a linear 
chain of xylose residues linked by (3(1 ->4) glyco- 
side bonds, with a variety of substituents, de- 
pending on its source (Joseleau et al., 1992). 
Biodegradation of xylan is accomplished by a 
complex set of enzymes generically called 'xy- 
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kinases', which are produced by fungi and bacte- 
ria (Biely, 1985). 

The hydrolysis of the xylan backbone is per- 
formed mainly by the action of the endoxylanases 
(E.C. 3.2.1.8), which liberate xylooligosaccharides 
of different length (Biely, 1985). A large number 
of endoxylanases have been described, purified, 
characterized and sequenced from different mi- 
croorganisms (Sunna and Antranikian, 1997), and 
based on sequence similarities and hydrophobic 
cluster analysis, they have been grouped in 
families 10 (or F) and 11 (or G) of the glycosyl 
hydrolases (Gilkes et al., 1991; Henrissat and 
Davies, 1997). Family 10 xy kinases have a cata- 
lytic domain with molecular weights in the range 
of 32-39 K. The structure of the catalytic domain 
of several family 10 xylanases has been deter- 
mined by X-ray crystallography and it consists of 
an eightfold P/ot barrel (Harris et al., 1996). These 
enzymes show a greater catalytic versatility than 
family 11 endoxylanases (Biely et al., 1997). Fam- 
ily 1 1 xylanases, on the other hand, have a much 
smaller catalytic domain (around 20 kDa), with 
an all (3-strand sandwich fold structure (Himmel 
et al., 1997). Some enzymes of family 10 and 
family 1 1 possess, in addition, cellulose or xylan 
binding domains (Torronen and Rouvinen, 1997). 

The endoxylanases have been found to be use- 
ful for several different biotechnological applica- 
tions (Section 3.4). Therefore, for the design and 
protein engineering of endoxylanases, a good 
knowledge of their sequence and structure is of 
great importance. 

For this purpose, a comparative analysis of the 
sequences of 82 catalytic domains of family 1 1 
glycanases has been performed in this work. The 
large number of sequences available allows a fine 
analysis of sequence similarities, with the purpose 
of establishing structural relationships, determine 
location of conserved (and possibly essential) se- 
quences and establish phylogenetic relationships. 

2. Materials and methods 

Sequences were obtained from the literature 
and from the public databases (SwissProt, Gen- 
Bank and CAZy (http://afmb.cnrs-mrs.fr/ 



pedro/CAZY/ghf.html)). The 82 sequences were 
aligned by the method described below. 

To define conserved residues in a reliable align- 
ment of numerous sequences of very different 
length and of poor similarity is not trivial (de 
Fays et al., 1999). Although alignments are widely 
used, the rate of false positive functional similari- 
ties deduced from skewed aligned positions is not 
easily overcome (Briffeuil et al., 1998). In this 
study, an original methodology has been applied 
to obtain reliable alignments allowing a precise 
delineation of the corresponding relevant residues 
in the different sequences, despite the fact that 
some sequences share a very low percentage of 
identity (10%). 

The first step of this methodology is based on 
the local alignment method matchbox (De- 
piereux and Feytmans, 1992). Default parameters 
were used to delineate conserved boxes (defining 
Predicted Structural Conserved Regions, PSCR) 
with an estimated level of confidence (from 1 to 9) 
at each position of the alignment (Depiereux et 
al., 1997). In this work, PSCR's are defined for 
reliability scores below 5 on more than 2 non-re- 
dundant sequences, which corresponds to a rate 
of confidence of over 90%. This ensures less than 
10% of 'false positive' aligned positions, a remark- 
able rate at this level of low similarity. 

The second step of the procedure is based on 
the global alignment method CLUSTAL (Higgins et 
al., 1992). It is used to align the regions inserted 
between the anchor points defined above. Thus, 
the final alignment obtained by this original pro- 
cedure takes advantage of both approaches: 
matchbox for definition of the boxes (PSCR's), 
and clustal for the multiple alignment of the 
segments outside the boxes. 

The proposed classification of the family 1 1 
endoxylanases is based on a principal coordinates 
analysis computed from a similarity matrix be- 
tween the sequences (Depiereux and Feytmans, 
1991). The method takes advantage of a grouping 
of the sequences independent of any alignment 
steps and thus avoids a misclassification due to 
misalignments. Briefly, the similarity coefficient is 
based on matches, for a given threshold, between 
short unaligned segments of the sequences com- 
pared; eigenvectors and eigenvalues are computed 
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by matrix diagonalization, and sequences are plot- 
ted for factors 2 and 3, factor 1 being poorly 
discriminant. Eigenvector coordinates have been 
submitted to a cluster analysis, Ward's method, 
run on statistica 4.1 (Statsoft, lnc) and plotted 
in an excel spreadsheet. 

PSCR delineation and factor analysis have been 
performed by the programs Align and Explore on 
the matchbox server http://www.fundp.ac.be/ 
sciences/biologie/bms/matchbox_submit.shtml 
and global alignment on the Clustal server http:// 
dot.imgen.bcm.tmc.edu:9331/multi-align/Options/ 
clustalw.html. 



3. Results and discussion 

Table 1 lists the 82 enzymes used in this study. 
It includes 36 sequences from bacteria, 43 from 
fungi, 2 from protozoa and 1 from insects. 
Twenty-two bacterial and 8 fungal enzymes are 
multidomain proteins; they correspond mainly to 
anaerobic microorganisms. Only the sequence of 
the catalytic domain of the mature enzymes is 
considered in this analysis. The amino terminus of 
the mature enzymes was taken to be either the 
experimentally determined terminus, when re- 
ported in the literature, or the predicted terminus, 
determined by means of the program SIGNALP 
(Nielsen et al., 1997). The enzymes from Fi- 
brobacter succinogenes, Neocallimastix frontalis 
and Neocallimastix patriciarum have two family 
1 1 catalytic domains each; these domains are con- 
sidered separately in the analysis. The size of the 
catalytic domain of the multidomain enzymes is 
somewhat arbitrary; in Fig. 1 the sequences be- 
longing to these enzymes are marked with a star 
at the beginning and/or end to indicate this fact. 

The enzymes analyzed (not considering the 
multidomain proteins) range in molecular mass 
from 19035 (Trichoclerma reesei ) to 25908 
(Clostridium acetobutylicum), and in sequence 
length from 175 (Polypastron multivesiculatum 
Xyn A) to 233 amino acid residues (C. aceto- 
butylicum). A shorter sequence (Xyn 4 from As- 
pergillus niger; 153 residues, accession number 
U39785) was not included in this study because it 
aligned poorly to the other sequences due to its 



shorter length, although it is considered part of 
family 1 1 by CAZy. 

Table 2 shows the pi's, pH and T° optima of 
those enzymes from the above list which have 
been purified and characterized. The pi values 
given are values measured in the laboratory; no 
theoretical estimations have been included. A 
wide range of pi values has been found: from a 
low of 3.5 for Aspergillus kawachii and Aspergillus 
niclulans to a high of > 10.25 for Streptomyces 
liviclans xylanases. Bacterial enzymes show pH 
optima ranging from 5.5 to 7, while those from 
fungi show a much wider range (from pH 2.0 to 
8.0), the majority having acidic pH optima. 

The three-dimensional structure of family 1 1 
endoxylanases has been determined for several 
enzymes, from both bacteria and fungi (Table 3). 
The catalytic domain folds into two (3 sheets (A 
and B) constituted mostly by antiparallel [3 
strands and one short alpha helix and resembles a 
partially closed right hand (Torronen and Rou- 
vinen, 1997). The loop between strands B7 and B8 
forms the 'thumb' and the loop linking strands B6 
and B9 is the 'cord'. A great similarity is found in 
all these structures. Fig. 2 presents a topology 
diagram showing the sequence of the secondary 
structure elements found in these enzymes. Two 
glutamic acid residues have been found to be 
catalytically essential and are located in strands 
B4 and B6, respectively (Torronen and Rouvinen, 
1997). 

3.1. Multiple sequence alignment 

The original alignment methodology used in 
this work is aimed at delineating structural fea- 
tures shared by all the xylanase sequences in- 
cluded in the analysis. First, boxes obtained from 
the matchbox alignment program have been 
shown to accurately outline conserved structural 
motifs (De Bolle et al., 1995; Vinals et al., 1995; 
Bertrand et al., 1997, 1998; Depiereux et al., 
1997). The fact that these predicted structurally 
conserved regions actually reflect robust structural 
similarities among the endoxylanases has been 
checked by comparison of the three-dimensional 
structures of the 1 1 xylanases of known crystallo- 
graphy structure (Table 3) using the homology 
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Table I 

Family 1 1 xylanases used in this study 



Organism 


Protein 


Gene 


Accession No. 
(GenBank) 


Catalytic 


domain 


References 










Length" 


MW 




Bacteria 














Aeromonas caviae 


Xylanase I 


xynA 


D32065 


183 


20212 


Kubata et al. (1997) 


Ba c i litis agar a dhaerens 


Xy kinase 




A48223 


221 


24 681 


Sabini et al. (1999) 


Bacillus circulans 


XLNA 


xlnA 


X07723 


185 


20 382 


Yang et al. (1988) 


Bacillus pumihts 


XYNA 


xynA 


X00660 


201 


22 515 


Fukusaki et al. (1984) 


Bacillus sp. 


Xylanase Y b 


xyn Y 


S51779 


200 


22 201 


Yu et al. (1993) 


Bacillus sp. 


Xylanase S 


xynS 


X59058 


185 


20 364 


Yu et al. (1993) 


Bacillus sp. 


Xylanase 


xynA 


U51675 


185 


20 220 


Jeong et al. (1998) 


Bacillus sp. D3 


Xylanase 


xyn 




182 


20 683 


Harris et al. (1997) 


Bacillus sp. 41 M-l 


Xylanase J b 


xynJ 


AB029319 


199 


22 098 


Nakai et al. (1994) 


Bacillus 


XYNA 


xynA 


U 15985 


191 


21 083 


Cho and Choi (1995) 


stearothermophilus 














Bacillus sub til is 


Xylanase A 


xynA 


M 36648 


185 


21 451 


Pa ice et al. (1986) 


Caldicellulosiritptor sp. 


Xylanase b 


xynD 


AF036925 


199 


22 158 


Morris et al. (1999) 


Rt69B.l 














Cellulomonas jimi 


XYLD b 


xynD 


X76729 


198 


21 451 


Millward-Sadler et al. 
(1994) 


Cellvibrio mixtus 


XYLA b 


xynA 


Z48925 


207 


22 550 


Millward-Sadler et al. 
(1995) 


Clostridium 


Xylanase B 


xynB 


M31726 


233 


25 908 


Zappe et al. (1990) 


acetobutylicum 














Clostridium stercorarium 


XynA b 


xynA 


D 13325 


193 


22 130 


Sakka et al. (1993) 


Clostridium thermocellum 


Xylanase U b 


xyn U 


AF04776I 


204 


22 842 


Unpublished 


Clostridium thermocellum 


Xylanase V b 


x yn V 


AF047761 


204 


22 787 


Unpublished 


Clostridium thermocellum 


XynA b 


xynA 


AB010958 


200 


22 445 


Hayashi et al. (1999) 


Clostridium thermocellum 


XynB b 


xynB 


A BO 10958 


200 


22 365 


Hayashi et al. (1999) 


Dictyoglomus 


Xylanase B b 


xynB 


U 76545 


198 


22 204 


Morris et al. (1998) 


thermophdum 














Fibrobac ter succinogenes 


XynC (domA) b 


xynC 


U01037 


234 


25 530 


Paradis et al. (1993) 


Fibroba cter succinogen es 


XynC (domB) b 


xynC 


U01037 


216 


24 437 


Paradis et al. (1993) 


Pseudomonas fluo r esc ens 


XYLE b 


xynE 


Z48927 


202 


22 130 


Millward-Sadler et al. 
(1995) 


R i tmino coccus albus 


XynA b 


xynA 


U43089 


236 


26 314 


Unpublished 


Rum ino co ecus jla vejaciens 


XYLA b 


xynA 


Z11127 


221 


24 346 


Zhang and Flint (1992) 


Rum ino co ecus jla vejaciens 


XynB b 


xynB 


Z35226 


216 


24216 


Zhang et al. (1994) 


Ruminococeus jla vejaciens 


XYLD b 


xynD 


S61204 


213 


24 031 


Flint et al. (1993) 


Ruminococcus sp. 


Xylanase l b 


xyn 1 


Z49970 


213 


23 904 


Unpublished 


St rep ton jyecs tividans 


XlnB b 


xlnB 


M64552 


192 


21 064 


Shareck el al. (1991) 


St rep ton jyees tividans 


XlnC 


xlnC 


M64553 


191 


20 715 


Shareck et al. (1991) 


Streptomyces sp. EC 3 


Xylanase 


xln 


X81045 


191 


20 931 


Mazy-Servais et al. (1996) 


Streptomyces sp. S38 


Xylanase 


xyll 


X985518 


190 


20 585 


Georis et al. (1999) 


Streptomyces sp. 36a 


Xylanase 






192 


20 973 


Nagashima et al. (1989) 


Streptomyces 


STX-Il b 


stx-\\ 


D85897 


190 


20 738 


Tsujibo et al. (1997) 


thermoviolaceus 














Thermomonospora j'usca 


TfxA b 


xynA 


U01242 


190 


20 900 


Irwin et al. (1994) 


Fungi 














Ascochyta pisi 


Xylanase 


xyll 


Z68891 


208 


22 185 


Liibeck et al. (1997) 


Aspergillus awamori 


EX LA 


exlA 


X78115 


184 


19 876 


Messing et al. (1994) 
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Table I (Continued) 



Organism Protein Gene Accession No. Catalytic domain References 

(Gen Bank) 











Length" 


MW 




Aspergillus kawachii 


Xylanase B 


xynB 


P48824 c 


207 


22 259 


Unpublished 


Aspergillus kawachii 


XynC 


xynC 


DI4848 


184 


19 876 


Ito et al. (1992a) 


A spergil/us nidulans 


X22 


xlnA 


249892 


188 


20 235 


Perez-Gonzalez et al. (1996) 


Aspergillus nidulans 


X24 


xlnB 


249893 


188 


20 077 


Perez-Gonzalez et al. (1996) 


Aspergillus niger 


Xylanase A 


xylA 


AI9535 


184 


19 890 


Maat et al. (1992) 


Aspergillus niger 


XynNB 


xynNB 


D38071 


188 


20 069 


Kinoshita et al. (1995) 


Aspergillus niger 


Xyn5 


XYN 5 


U 39784 


195 


21 143 


Unpublished 


Aspergillus oryzae 


XynGl 


xynGl 


AB003085 


189 


20 589 


Kimura et al. (1998) 


Aspergillus tubigensis 


XYLA 


xlnA 


L26988 


184 


19 837 


de Graaff et al. (1994) 


Aspergillus tubigensis 


Xylanase B 




A39368 


207 


22 240 


Patent WO 94 14965- A (1994) 


A ureobasidium pullulans 


XynA 


xynA 


U 10298 


187 


20 074 


Li and Ljungdahl (1994) 


Chaetomiutn gracile 


CgXA 


cgxA 


D49850 


189 


20 149 


Yoshino et al. (1995) 


Chaetomium gracile 


CgXB 


cgxB 


D49851 


21 1 


22 525 


Yoshino et al. (1995) 


Claviceps purpurea 


Xylanase 


xyll 


Y 16969 


197 


21 460 


Giesbert et al. (1998) 


Cochliobolus carbonum 


Xyll 


XYLl 


L13596 


191 


20 856 


Apel et al. (1993) 


Cochliobolus carbonum 


Xyl2 


XYL2 


U58915 


191 


21 199 


Apel-Birkhold and Walton 














(1996) 


Cochliobolus carbonum 


Xy13 


XYL3 


U58916 


183 


19 858 


Apel-Birkhold and Walton 














(1996) 


Cochliobolus sativus 


Xylanase 


xvl2 


AJ004802 


212 


23 658 


Emami and Hack (2001) 


Cryptococcus sp. S-2 


Xyn-CS2 


xvn-CS2 


D63382 


184 


20 209 


lefuji et al. (1996) 


Humicola insolens 


Xyll 


xyll 


X76047 


208 


23 814 


Dalbogc and Heldt-Hansen 














(1994) 


Magnaporthe grisea 


XYN22 


xyn22 


L37529 


194 


21 427 


Wu et al. (1995) 


Neocallimastix frontalis 


Xylanase 2 


XYN 2 


S48865 


212 


23 933 


Unpublished 


Neocallimastix frontalis 


XYN3 (domA) b 


xyn3 


X82266 


223 


24 394 


Durand et al. (1996) 


Neocallimastix frontalis 


XYN3 (domB) b 


xyn3 


X 82266 


223 


24 532 


Durand et al. (1996) 


Neocallimastix patriciarum 


XYLA doml) b 


xynA 


X65526 


226 


24 941 


Gilbert et al. (1992) 


N eocallimastix patriciarum 


XYLA (dom2) b 


xynA 


X65526 


225 


24 770 


Gilbert et al. (1992) 


Orpinomyces strain PC-2 


XynA b 


xynA 


U578I9 


225 


24 810 


Li et al. (1997) 


Paecihn tyces varioti 


PVX 




P81536 c 


194 


21 365 


Kumar et al. (2000) 


P en ici Ilium sp. 40 


XynA 


xynA 


AB035540 


190 


20713 


Kimura et al. (2000) 


Pen ici I Hum pur pit rogen um 


XynB 


xynB 


250050 


183 


19 371 


Diaz et al. (1997) 


Pichia stipitis 


Xylanase A b 


xynA 


AF15I379 


232 


26 291 


Unpublished 


Piromyces sp. (inactive) 


XYLA b 


xynA 


Vfl 1 oco 




2j j jo 


1 anutti et al. ( iyy:>) 


Piromyces sp. (active) 


XYLA b 




X91 858 


222 


24 803 


P'irmf ti pt •)! ( 1 QQM 
lullUlU cl ill. \ \yyJ) 


Schizophyllum commune 


Xylanase A 


xynA 


P35809 c 


197 


20 965 


Oku et al. (1993) 


Thermomyces lanuginosus 


XynA 


xynA 


U35436 


206 


22614 


Schlacher et al. (1996) 


Trichoderma harzianum E5820 kD Xylanase 




P48793 c 


190 


20 690 


Yaguchi et al. (1992b) 


Tricl i oden na reesei 


XYNI 


xyn I 


X69574 


178 


19 035 


Torronen et al, (1992) 


Tricl i ode r ma reese i 


XYNII 


xyn2 


X69573 


190 


20 829 


Torronen et al. (1992) 


Trichoderma reesei 


Xyn2 


XYN 2 


U2419I 


190 


20 731 


La Grange et al. (1996) 


Trichoderma viride 


Xylanase 1 1 A 




A44594 


190 


20 759 


Yaguchi et al. (1992a) 


Trichodert na viride 


Xylanase I IB 




A44595 


190 


20 743 


Unpublished 


Protozoa 














Poly plastron 


XYN A 


xynA 


AJ009828 


219 


25 192 


Unpublished 


multivesiculatum 












Polyplastron 


Xylanase 


polyX 


AB01 1274 


175 


19 394 


Unpublished 


multivesiculatum 












Insect 














Phaedon cochleariae 


Xylanase 




Y 17908 


200 


22 070 


Unpublished 



11 Number of amino acid residues. 
b Multidomain protein. 
c SWISS-PROT entry. 
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f 1 


XynD 
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29 Ru 


sp 


Xyll 








. . . A D A Q Q R G 
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. YEMWNQNG 


30 St 


li 


XlnB 








OTVVTTNQEG 


T N N G Y Y . . . 


. Y S F w T D S . 


31 St 


li 


XlnC 








ATTITTNQTG 


T D . G M Y . . . 


. Y S F W T D G . 


32 St 


sp 


EC3 








ATTITTNQTG 


Y D . G M Y . . . 


. Y S F W T D G . 


33 St 


sp 


S38 








DTV1TTNQTG 


T N N G Y Y . . . 


. Y S F W T D G . 


34 St 


sp 


36a 








ATTITTNQTG 


y d . gh y . . . 


. Y S F W T D G . 


35 St 


th 


STII 








. DT1TSNQTG 


T H N G Y F . . . 


. Y S F W T D A . 


36 Th 


fu 


TfxA 








. , AVTSNETG 


Y H D G Y F . . . 


. Y S F W T D A . 


37 Aso.pi . xyl 








ARAGTPSSQG 


THNGCF. . . 


. Y S W W T D G . 


38 As 


aw 


EXLA 










. S A G I N 


Y V Q N Y N G . 


39 As 


ka 


xylB 








SERSTPSSTG 


E N N G Y Y . . . 


. Y S F W T D G . 


4 0 As 


ka 


xynC 










. S A G I N . . . 


. Y V Q N Y N G . 


41 As 


nid.X22 








. . . S T P S S T G 


W S N G Y Y . . . 


. Y S F W T D G • 


4 2 As 


nid.X24 








. . . S T P S S T G 
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Fig. 1. Multiple alignment of 82 sequences of family 1 1 endoxylanases performed by MATCHTAL. The sequences follow the same 
order given in Table 1. On top of the alignment, secondary structure elements observed in the crystal structures are provided. Boxes 
(Predicted Structural Conserved Regions) obtained by matchbox are shaded. The two catalytic Glu residues (El 69 and E289 
according to the numbering of the multiple alignment) are in boldface. Residue 100 (N or D, depending on the pH optimum, see 
text) is underlined. The putative start and/or end of the catalytic domain sequences belonging to multiple domain xylanases is 
indicated by a star. For abbreviations see Table 1. 
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Table 2 

Isoelectric point, pH and T° optimum of family 1 1 xylanases of known sequence 



Organism 


Xyla n ase 




pH optimum 


Optimum T° 


Reference 


Bacteria 












A eron t on cts co vioc 


Xylanase 1 


7.1 


7.0 


55 


Kubata et al. (1992) 


H/it'illn v ftu y/i/ilisi(>i'/*ii v 

UliL lllllO ttVI UHftltLf t-tlti 


xyl 1 1 


8.8 


n.d. 


n.d. 


ociuiiii ci ti j . \\yyy ) 


Rncilfnv nuniihtv 


XYNA 


n.d. 


6.5 


45-60 


Panbanered et al C1983) 


Bacillus <;n D3 


X vlll I1MSP 


7.7 


6.0 


75 


Harris et al H997} 


Bacillus sn 41 Ml 


Xvlnniisp 1 


5.5 


9.0 


50 


Naknmuri ft ;\\ C1992^ 

i ^ u ivu 1 1 1 li i ii 1 1 til. \ i y y ^ f 


Fi/irilltiv vnhtiliv 


XvUin^i^p A 


8.9 


n.d 


n.d. 


Puire e\ ;il ( 1 QR6^ 


filflirt'llfiifivivtifttfir ir* 


Xvlnnnsp 
y i ci 1 1 ci a t< 


n.d. 


5.5 


70 


iviuiiia ci cii. \\ jyy } 


Rt69B. 1 












Clost r idium o ce tobu tyl icum 


V 1 

Xylanase B 


O.J 


s s ft 

J. j— o 


uu 


Lee ei ai. ^ i yo / ) 


Clostridium stercorarium 


XynA 


4.5 


7.0 


75 


Sakka et al. (1994) 


Clostridium thermoccllum 


XynA 


n.d. 


6.5 


65 


Hayashi et al. (1999) 


Dictroglonuts thermophilum 


Xylanase B 


n.d 


6.5 


85 


Morris et al. (1998) 


Fibrobocter succinogenes 


XynC 


6.2 


6.5 


n.d. 


Paradis et al. (1993) 


Rwninococcus flovefociens 


XYLA 


5.0 


5.5 


50 


Flint et al. (1991) 












Garcia-Campayo et al. (1993) 


Streptomyces lividans 


XlnB 


8.4 


6.5 


55 


Kluepfel et al. (1990) 


Streptomyces lividans 


XlnC 


> 10.25 


6.0 


57 


Kluepfel et al. (1992) 


Streptomyces sp. EC3 


Xylanase 


9.1 


n.d. 


n.d. 


Mazy-Servais et al. (1996) 


Streptomyces sp. S38 


Xyll 


9.8 


6.0-6.5 


55-60 


Georis et al. (2000) 


Streptomyces 


STX-II 


8.0 


7.0 


60 


Tsujibo et al. (1992) 


thermovioioceus 












Thermomonospora fusca 


TfxA 


10 


7.0 


n.d. 


Irwin et al. (1994) 


Fungi 












rispergiititj itw ufnuf i 


FX I A 

LALr\ 


^ 7 


11. u. 


n H 
11. u. 
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j. j 




JU 


It/A pt -»1 ( 1 QQ'M 
1 lO cl dl. \ i yJL ) 


A efi/Jfo ititi i* iii/ltiifiitv 
ri jl/Cf t^IlttliS flilllllllflj 


X22 


6.4 


5.5 


62 


1 ti UiiJiuc/!. -iZibpiiidi li til. \\yyj) 


rlApet gillliCi filllltiliflA 


X24 




J.J 


JZ 


Hernandez- Espinar et al. (1996) 


risfJLt gltlltA iiigLr 


Xvl A fnr \\ 


1 7 


J.U 


n ft 
1 1 . u . 


M*i'it pt -il ( lOQ*?^ 
ivituu t,i di. \ \jyi.) 












rwrengei ana uijKSita ^lyyoj 


/t Vi)f*vit 1 1 1 1 i v nnjify 
rlAUL.t VltltiA fltl^Lf 


XvnNR 


n.d. 


5.0 


n d 


iviiiudiiiui ti til. \ \yyj) 


A sp erg illi is tubigens is 


A I Lr\ 


^ ft 


n.o. 


n.a. 


gc oiaaii et at. yxyy^) 


Aweobosuhum pullulons 


Y\;n A t A PY 1 1 ^ 
AynA (ArA \\) 


Q d 
y.H 


4.0 


j4 


li et ai. { i yyj ) 


Cochliobolus corbonum 


Ay! i 


> y. 3 




4j 


Holden and Walton (lyyi) 


Cochliobolus corbonum 


Ayi^ ana j 


> y.j 


< ft 
J.U 


« ^i 
n.o. 


Holden ana Walton (lyy^) 


wntfif'fM'cuv Qn ^-9 


Xvn-CS2 


74 


2.0 


40 


Ipfnii pt 'il i \ QQA^ 
iciliji ti tii. \iyy\j) 


Magnoporthe grisea 


XYN22 


9.7 


n.d. 


n.d. 


Wu et al. (1995) 


Poecilomyces vorio t i 


PVX 


3.9 


5.5-7.0 


65 


Krishnamurthy and Vithayathil 












(1989) 


Penicillium sp. 40 


XynA 


4.7 


2.0 


50 


Kimura et al. (2000) 


Pcnicillium purpurogenum 


XynB 


5.9 


3.5 


50 


Belancic et al. (1995) 


Scl uzophy Hum commune 


Xylanase A 


4.5 


5.0 


50 


Jurasek and Paice (1988) 


Thermomyces lan t tgin osus 


XynA 


4.1 


6.5 


65 


Gomes et al. (1993) 












Schlacher et al. (1996) 


Trichodermo harzionum E58 


20 kD xylanase 


9.4 


5.0 


50 


Wong and Saddler (1992) 


Tricho den no reesei 


XYNI 


5.2 


3.5-4 


n.d. 


Torronen et al. (1992) 


Trichodermo reesei 


XYNII 


9.0 


4.5-5.5 


n.d. 


Torronen et al. (1992) 


Trichodermo viride 


Xylanase IIA 


9.3 


5.0 


53 


Wong and Saddler (1992) 



n.d.: not determined. 
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Table 3 

Family 1 1 endoxylanases of known three-dimensional structure 



Organism Protein 



Bacteria 

Bacillus agarad/iaerens X y 1 a n a se 

Bacillus circukms XLNA 

Bacillus sp. D3 Xylanase" 

Dictyoglomus thermophilum XynB 11 

Fungi 

Aspergillus kawachii XynC 

Aspergillus niger Xyll or A 

Paecilomyces varioti Xylanase 11 

Thermomyces lanuginosus XynA 1 ' 

Trichoderma harzianum E58 Xylanase 

Trichoderma reesei XYNI 

Trichoderma reesei XYNI1 



u Thermophilic protein. 

package (MSI, San Diego) or the FSSP program 
(Holm and Sander, 1996). In addition, the use of 
CLUSTAL to globally align the regions inserted 
between these anchor points allows to highlight 
several key residues in less conserved regions. In 
short, the alignment obtained on the basis of 
sequence similarity also reflects structural similari- 
ties among the endoxylanases. 

The alignment is shown in Fig. 1 . The residues 
are numbered on top; the numbering exceeds the 
length of any individual sequence since it includes 
all gaps assigned by the alignment. The secondary 
structure elements are numbered as shown in Fig. 
2. 

Four boxes (shaded in Fig. 1) are found along 
the whole set of 82 sequences and correspond to 
the PSCRs. Interestingly, the sequence alignment 
provides a pattern of aligned segments and gaps 
consistent with the observed secondary structure 
elements. Indeed, the lowest sequence homology is 
found in the regions between beta strands 
(residues 94-103 between A3 and B3; residues 
111-130 between B3 and A5; residues 238-254 
between B7 and A6). The region corresponding to 
the thumb, (between strands B7 and B8) is, how- 
ever, very well conserved along all the sequences, 
except for Piromyces sp. XYLA. This last enzyme 
is reported to be the inactive form of the protein, 
in contrast to Piromyces sp. XYLB, the active 



PDB code References 



lqh6 


Sabini et al. (1999) 


lxnb 


Wakarchuk et al. (1994a) 




Harris et al. (1997) 


lfsj 


McCarthy et al. (2000) 


Ibkl 


Fushinobu et al. (1998) 


lukr 


Krengel and Dijkstra (1996) 


1 pvx 


Kumar et al. (2000) 


lyna 


Gruber et al. (1998) 


lxnd 


Campbell et al. (1993) 


Ixyn 


Torronen and Rouvinen (1995) 


Ixyp 


Torronen et al. (1994) 



form, that also contains the conserved segment 
defining the thumb region. The cord region (be- 
tween B6 and B9) is also conserved. The PSCR's, 
in particular, correspond to secondary structure 
elements: box 1 to B5, box 2 to B6, box 3 to B8 
and box 4 to the helix. 

The amino terminal regions of the aligned se- 
quences show no similarities in the first 30 or so 
residues. One sequence is particularly long in this 
region (C. acetobutylicum). One of the sequences 
(P. multivesiculatum, ABO 11274) starts only at 
residue 81, and lacks strands Bl, B2 and part of 
A2, suggesting that Bl and B2 may not be neces- 
sary for enzymatic activity. Residue 40 is a G in 
60 sequences, and is missing in 15. It belongs to 
Bl in the structure of T. reesei XYN II (Torronen 
et al., 1994). Its importance is unclear, since Bl is 
missing totally or in part in the indicated 15 
sequences. 

The segment corresponding to B2 shows good 
similarity among the sequences. Position 52 is in 
most cases an aromatic residue: 63 sequences have 
a Y, while 14 show a W or a F. At position 55, W 
predominates (70 sequences), while 62 sequences 
have a D in position 57. Between B2 and A2, two 
long insertions (probably forming a loop) are 
found in the endoxylanases from Pen ici Ilium sp. 
and Pichia slip it is. 50 G's are observed in position 
76 and 50 in position 77. Strand A2 shows low 
similarity except at position 81, where 24 Y and 
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35 M are found. In the loop linking A2 and A3, 
61 G's are observed in position 86 and 67 in 
position 87, but not necessarily together in the 
same sequence. In A3, a highly conserved F or Y 
(79 sequences) is found in position 89 and 75 W in 
93; in this last case, the few replacements are F or 
Y; only one non-conserved replacement (H) is 
observed in P. s tip it is. 

Residue 100 is in all cases a D (27) or N (65); as 
we will see below, this residue is related to the pH 
optimum of the enzyme. At the end of B3, G109 
is present in 79 sequences (it is replaced by a K in 
Bacillus agar adhaer ens) and allows a special twist 
to the chain (Torronen and Rouvinen, 1997). 

A5 is not present in P. stipitis, which shows a 
big gap between 1 10 and 144; in this sequence, B3 
and B5 are probably linked by a short loop. B5 



corresponds to the first box detected by match- 
box, indicating a highly conserved stretch of the 
sequence, particularly YGW (152-154), present in 
all 82 enzymes. Y152 is strongly hydrogen bonded 
to El 67; it seems to be of great functional impor- 
tance since its mutation to F (in Bacillus circulans 
XLNA) leads to a totally inactive enzyme 
(Wakarchuk et al., 1994a). At position 150 21 Cys 
are present, the highest number found at a partic- 
ular location in all the sequences (see below). 

The next box corresponds to B6. It is preceded 
by a highly conserved Pro at 164 (present in 74 
sequences) and Leu 165 (74 sequences). E167 is 
common to all sequences and corresponds to one 
of the catalytic residues (the nucleophile); it is 
followed by two highly conserved Y's (75 in posi- 
tion 168, replaced in the rest of the enzymes by F; 




Fig. 2. Topology diagram of the family 1 1 endoxylanases. The residue numbering refers to that of Fig. 1. The figure is adapted from 
Torronen et al. (1994). 
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and common to all in 169) and by two aliphatic 
hydrophobic residues in positions 170 and 171. Tyr 
169 links both catalytic glutamates, acting as a 
charge stabilizing residue (Torronen and Rouvinen, 
1997), and its mutation to F in XLNA of B. 
circulans leads to a large decrease in activity 
(Wakarchuk et ah, 1994a). 

The 'cord' shows low similarity, except for a Pro 
(found in 70 sequences and missing in 7) in position 
1 85. B9 is a stretch of very low similarity and varies 
considerably in length, but it is followed by the 
third high similarity box, corresponding mainly to 
B8. Between B9 and B8, two highly conserved 
residues are apparent: D204 (in 76 sequences) and 
G205 (in 80 sequences); this last residue is consid- 
ered important in hairpin formation (Torronen and 
Rouvinen, 1997). Y208 is present in all 82 se- 
quences, R215 in 72 and N217 in 69 (this last 
residue is missing in 8 sequences). 

The 'thumb* is well conserved; it has a consensus 
sequence PS1XG where X is almost any residue and 
the others show few and mainly homologous re- 
placements. Pro 219 gives a twist to the strand at 
the beginning of the thumb. B7 is also well con- 
served; residue 227 is a T in 77 sequences and 228 
is F in 81 cases; a consensus sequence QYWSVR 
can be proposed for residues 230-235 (R is re- 
placed by K in one instance and the other residues 
show mainly conservative replacements). 

The loop connecting B7 and A6 differs in length; 
the longest being in the enzyme from P. stipitis, 
which also shows the longest loop between B2 and 
A2. No significant similarities are apparent in A6, 
which is immediately followed by the helix. 

The last box of high similarity comprises the 
carboxyl end of the oc-helix and the following 6-7 
residues. H262 is present in 79 sequences, W263 in 
80 and G270 in 79. 

B4 does not show high similarities, except for 
E287 (boldface in Fig. 1), a residue present in all 
sequences, which corresponds to the acid-base 
glutamate participating in catalysis, and which is 
followed by a highly conserved G (70 sequences). 
The last well defined secondary structure is A4. A 
highly conserved consensus sequence SSGS pre- 
cedes and starts this strand. 

The carboxyl terminus of the sequences is highly 
variable in length. The longest is found in B. 



agar adhaer ens and P. mult ivesicula turn, stretching 
25 and 22 residues, respectively, beyond the end of 
A4. 

3.2. Acidophil)? and thermostability 

A number of family 1 1 endoxylanases have been 
reported to be acidophilic. It has been postulated 
that for xylanases that function optimally under 
acidic conditions, residues spatially adjacent to the 
acid/base catalytic glutamate influence the pH 
optimum (Torronen and Rouvinen, 1995). In par- 
ticular, the substitution of N 100 (underlined in Fig. 
1) by D shifts the pH optimum from 5.7 to 4.6, as 
has been demonstrated by mutational, kinetic, and 
structural studies of N100D in B. circulans XYLA 
(Joshi et al., 2000). This agrees with the mutational 
analysis of xylanase C of A. kawachii, in which the 
single substitution of Asp 100 to Asn at this key 
position dramatically elevates its pH optimum 
from 2 to 5 (Fushinobu et al., 1998). Structural 
studies of xylanase A (or 1) from A. niger led to a 
similar conclusion. In the crystal structure of this 
enzyme of low pH optimum (Krengel and Dijkstra, 
1996), Asp 100 is assigned a critical role. 

An examination of the enzymes of known se- 
quence for which the pH optimum has been deter- 
mined (Table 2), shows that this correlation holds 
true with only one exception. The enzymes showing 
a pH optimum below 5 have D at position 100, 
while N is present in those with pH optima of 5 or 
more. The exception is XynC from F. succinogenes, 
an anaerobic bacterium from the rumen. This 
enzyme shows a pH optimum of 6.5 but has a D 
at position 100. This enzyme has two catalytic 
domains, and the pH optimum reported in Table 
2 corresponds to the native enzyme; the separate 
domains have a pH optimum of 6.0 (Zhu et al., 
1994). 

In conclusion, there is a strong correlation in that 
the residue hydrogen bonded to the general acid/ 
base catalyst at position 100 is asparagine in the 
so-called 'alkaline' xylanases, whereas it is aspartic 
acid in those with a more acidic pH optimum. 

Thermostability is an important issue in the 
properties of endoxylanases, due to their biotech- 
nological applications, particularly in cellulose 
biobleaching. Of the enzymes listed in Table 2, 
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seven of bacterial origin (Bacillus D3, Caldicellu- 
losiruptor sp., Clostridium ster cor avium, Clostrid- 
ium thermocellum, Dictyoglomus thermophilum, 
Streptomyces thermoviolaceus and Ther- 
momonospora fusca) and two from fungi (Pae- 
cilomyces varioti and Thermomyces lanuginosus) 
are considered thermophilic, based on their opti- 
mal temperature and stability at high tempera- 
ture. Table 3 shows that the three-dimensional 
structure of four of these enzymes has been 
determined. 

Thermophilicity and thermostability may be ex- 
plained by a variety of factors and structural 
parameters (Kumar et al., 2000). Of those, the 
importance of S-S bridges and aromatic 'sticky 
patches' can be analyzed by sequence alignment. 
Additional structural features potentially involved 
in thermal stability, such as salt bridges, aromatic 
interactions and entropic effects have been postu- 
lated in family 11 xylanases (Georis et al., 2000). 

Cysteine residues are not very common in these 
enzymes. Of the total number of residues in the 82 
sequences listed in Table 1, only 117 are Cys, a 
0.7%. Twenty-seven of the sequences have no Cys 
and 14 have only one; therefore, at least half of 
the total possesses no disulfide bridges. The three- 
dimensional structure shows that an S-S bridge is 
found in xylanase A from A. niger connecting Cys 
186 to Cys 211, thus attaching the cord to the 
large beta-sheet (B8) (Krengel and Dijkstra, 
1996). These two half-cystine residues are con- 
served in a few other fungal family 1 1 xylanases 
(A. kawachii, Aspergillus aw amor i, A. niger Xyn5, 
Aspergillus tub! gen sis, Cryptococcus sp. and Peni- 
cillium purpurogenum). The xylanases from the 
bacteria Cellvibrio mixtus and Pseudomonas 
fluorescens and of the insect Phaedon cochleariae 
have Cys at positions 188 and 213, which may be 
forming a similar bridge. The presence of this 
disulfide bridge may influence the stability of 
these proteins, although it does not give (at least 
to the enzymes analyzed so far) a thermophilic 
character. 

Of the seven thermophilic xylanases listed, three 
have no cysteines, and the three-dimensional 
structure of the Dictyoglomus thermophilus en- 
zyme shows no S-S bonds, although its sequence 
has 3 Cys. On the other hand, T. lanuginosus and 



P. varioti xylanases, both thermophilic, do have 
an S-S bridge linking Cys 203 (located in B9) and 
Cys 261 (in the a helix). Cochliobolus carbonttm 
Xyl3 and Scliizophyllum commune XylA also pos- 
sess only two Cys and in the same positions; the 
T° optimum of the former has not been reported, 
while the latter has a value of only 50 °C (Table 
2). All these results suggest that S-S bridges are 
unlikely to be of importance in the thermophilic- 
ity of family 1 1 xylanases. 

As pointed out by Turunen et al. (2001), ther- 
mophilicity (activity at high temperature) and 
thermostability do not necessarily depend on the 
same structural factors. They introduced a 
disulfide bridge plus other minor mutations in T. 
reesei Xynll (C203-C261) significantly increasing 
the thermostability without affecting the tempera- 
ture optimum. Wakarchuk et al. (1994b) have 
introduced an S-S bridge at the same position in 
B. circulans xylanase, obtaining similar results. 
Thus, the introduction of S-S bridges to enzymes 
may affect both properties differently. 

Harris et al. (1997) propose that the Bacillus D3 
xylanase (lacking S-S bridges) is stabilized 
through 'sticky patches' between pairs of 
molecules through the interaction of surface aro- 
matic residues. These residues are Tyr 53, 78, 120, 
201, 290, 295 and Trp 207 and 214. When the 
alignment is analyzed, it shows that Tyr 78, 120 
and Trp 207 are unique to this sequence (no 
aromatic residues are present in those positions in 
the remaining 81 sequences), while Trp 214 is 
shared by only 1 sequence (not thermophilic) and 
no other aromatic residues are found in this loca- 
tion. The remaining four residues show no clear 
alignment pattern. Therefore, it can be concluded 
that the 'sticky patch' pattern described for the 
Bacillus D3 xylanase is very unique, and it is 
unlikely to play an important role in stabilizing 
the other thermophilic family 1 1 xylanases of 
known sequence. 

Shibuya et al. (2000) using random gene shuffl- 
ing between a mesophilic (S. Hvidans XlnB) and a 
thermophilic (T. fusca TfxA) enzyme have shown 
the importance of the amino terminal segment of 
the protein in temperature stability. However, if 
the first 50 residues (our numbering) of the ther- 
mophilic enzymes in this study are aligned, no 
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Fig. 3. Results from the factor analysis. Graphical representation of the sequences in the plane of factors 2 and 3. Each point 
corresponds to a sequence, the position on the plot being associated to a percentage of the total variability within the data set. 
Distances separating each position in the plot are directly related to the sequence homology. Groups containing mainly sequences 
of fungi (I, II, III, IV) are marked by filled symbols. 



clear similarity pattern is observed, indicating that, 
again, the proposed thermostabilizing factor may 
be unique to the T. fusca enzyme. 

3.3. Classification of the family 11 xy kinases 

A factor analysis (Explore subprogram of 
matchbox) was performed in order to define 
groups and subgroups among the 82 sequences 
under study. Sequences have been represented in a 
three-dimensional space, each factor being associ- 
ated to a percentage of the total variability between 
the sequences. The first factor is generally trivial, 
and the grouping of the sequences was performed 
in the plane of factors 2 and 3 (Depiereux and 
Feytmans, 1992). Graphical representation of the 
sequences in the plane of factors 2 and 3 (Fig. 3) 
was obtained directly from the program match- 
box and represented via Excel. The distances 
separating each position in the plot are directly 
related to the sequence homology, thus allowing a 



classification. Sequences close in the plane of fac- 
tors 2 and 3 were grouped/clustered, leading to the 
classification presented in Table 4. A first cluster- 
ing, based on the distance between coordinates in 
the plane of factors 2 and 3, led to the definition 
of a total of seven groups. Those groups were 
labeled (A, B, C, I, II, III, IV) a posteriori, on the 
basis of the origin (fungal or bacterial) of the 
sequences belonging to them. The two larger 
groups (I and III), were further divided into sub- 
groups on the basis of the distance between coor- 
dinates in the plane of factors 2 and 3 within the 
group. Group B, although containing many se- 
quences (13) was not divided into subgroups as the 
distribution of points is narrow (factor 2 comprised 
between —0.453 and —0.518; factor 3 comprised 
between 0.015 and 0.087) in comparison to the 
distribution of, for example, group III ( — 0.431 < 
factor 2 < -0.240; - 0.236 < factor 3 < -0.161). 

This method avoids affecting the sequence clas- 
sification by misalignments occurring in a previous 
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step, and represents the sequences on a plane 
taking into account Euclidian distance rather than 
hierarchical merging. The graphical representa- 
tion of the eigenvectors (sequence coordinates) in 
the plane of factors 2 and 3 (Fig. 3) allows to 
classify and group the sequences by eye, since the 
distances separating each position in the plot are 
directly related to differences in the sequences. 
The classification obtained has been validated by 
a clustering (Ward method) performed on the 
sequence coordinates. 

Interestingly, the groups obtained from the fac- 
tor analysis are highly homogeneous, meaning 
that they mostly contain only sequences from 
either bacteria or fungi. It is also noteworthy that 
some bacterial sequences are more homologous to 

Table 4 

Classification of family 1 1 xylanases based on a factor analysis 



sequences of fungal endoxylanases than to se- 
quences of other bacteria (see for example group 
I). 

In this work a robust classification methodol- 
ogy based on factor analysis, without previous 
sequence alignment and without reference to any 
phylogenetic inference is applied for the first time 
to classify a large set of sequences. Interestingly, 
the classification of xylanases presented in Table 4 
compares favorably with previous classifications 
based on smaller sequence datasets. In particular, 
our results are in good agreement with the classifi- 
cation presented by Georis et al. (1999) deduced 
from a phylogenetic tree analysis on 63 sequences. 
According to their study, family 1 1 endoxylanases 
were subdivided into six main groups: three for 
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Sequence of bacterial enzymes are in italics and enzymes produced by protozooa or insects are underlined. For abbreviations see 
Table 1 and Fig. 1. 
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fungi, two for Gram positive bacteria and one for 
Gram negative bacteria. 

A similar subdivision is found here: groups I, II 
and III contain mainly fungal enzymes. The en- 
zymes in groups I and II are mostly the 20 kDa 
enzymes from Ascomyceta and Basicliomyceta. 
Most enzymes of group I exhibit a basic pi. Those 
in group II show an acidic pi. Enzymes of group 
III are mainly produced by anaerobic fungi. Like 
in the classification proposed by Georis et ah (1999) 
two enzymes produced by F. succinogenes (XyCA 
and XyCB), an anaerobic Gram-negative bac- 
terium, also fall in this group. 

In addition to those three groups associated 
mainly to fungal enzymes, a fourth group that was 
not present in previous classifications emerges from 
the present factor analysis. It only contains two 
enzymes. One is produced by C. carbonum (Xyl3) 
and is clearly distinct from Xyl 1 and Xyl2 produced 
by the same fungus. In the classification proposed 
by Georis et al. (1999), this enzyme is isolated in 
the unrooted phylogenetic tree. The second enzyme 
in group IV is produced by P. stipitis (XynA); this 
enzyme has not been included in any previous 
classification. 

Bacterial enzymes are mainly divided into three 
groups (A, B, C). Group A contains mainly en- 
zymes produced by members of the Actinomyc- 
etacecte and the Bacillaceae families, strictly aerobic 
Gram-positive bacteria. Groups B and C are more 
closely related and contain mainly enzymes from 
anaerobic Gram-positive bacteria, such as those 
from Clostridium or Ruminococcus, which usually 
live in the rumen. Two bacterial enzymes, XylE and 
Xyl A xylanases from P . fluorescens and C. mixtus, 
respectively, strictly aerobic Gram-negative bacte- 
ria, are found in subgroup Ic. In terms of sequence 
similarities, those two Gram-negative bacterial en- 
zymes more closely resemble xylanases produced by 
fungi (group I) than other Gram-positive bacterial 
enzymes (e.g. groups A, B, C). They were also 
classified in a distinct group by Georis et al. (1999). 

3.4. Biotecimological significance 

Xylanases are finding an increasing number of 
applications, both alone and in combination with 
other enzymes. Among them are cellulose pulp 



biobleaching (Buchert et al., 1994), bread-making 
(Courtin et al., 1999) and saccharification of ligno- 
cellulosic biomass (Lee, 1997). These applications 
require enzymes capable of operating under specific 
and often unnatural conditions. Parameters of 
particular interest are thermostability and pH opti- 
mum. Biobleaching, for instance, requires ther- 
mostable and alkali-stable enzymes. Family 1 1 
xylanases may be of particular interest in 
biobleaching due to their smaller size; a fact which 
may facilitate penetration in the cellulose fiber 
network. 

Optimizing enzyme properties for a particular 
application can be achieved by random and site-di- 
rected mutagenesis. Arase et al. (1993) have 
achieved a significant stabilization of Bacillus 
pumihis XynA by random mutagenesis. Four heat- 
resistant mutants were isolated, and the stabilizing 
mutations were found to be clustered in the N-ter- 
minal region. In three of the four mutants, a 
mutation of G92 to S or to D was observed. 
Residue 92 is located in the A3 strand, and it is 
always a polar or charged residue except in B. 
pumilus and Penicillium sp. where it is a G (Fig. 1); 
the introduction of an S or a D at that position may 
allow the formation of a stabilizing hydrogen bond. 

An example of the use of site-directed mutagen- 
esis is given by Turunen et al. (2001). They have 
introduced a disulfide bridge by means of the 
mutations S203C-N261C, thus increasing signifi- 
cantly the half-life of the enzyme at 65°. As shown 
in Fig. 1, the thermophilic xylanase from T. lanug- 
inosa possesses this bridge. 

It has proved difficult to manipulate the proper- 
ties of an enzyme in a predictable manner (Tor- 
ronen and Rouvinen, 1997). However, for the 
design and protein engineering of endoxylanases 
with properties suitable for their different applica- 
tions, a good knowledge of its sequence and struc- 
ture is a necessary condition. The information and 
analysis presented in this publication should be 
useful for this purpose. 
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The crystal structures of thermophilic xylanases from 
Chaetomium thermophilum and Nonomuraea flexuosa were 
determined at 1 .75 and 2. 1 A resolution, respectively. Both 
enzymes have the overall fold typical to family 1 1 xylanases 
with two highly twisted p-sheets forming a large cleft. The 
comparison of 12 crystal structures of family 11 xylanases 
from both mesophilic and thermophilic organisms showed 
that the structures of different xylanases are very similar. The 
sequence identity differences correlated well with the struc- 
tural differences. Several minor modifications appeared to be 
responsible for the increased thermal stability of family 1 1 
xylanases: (a) higher Thr : Ser ratio (b) increased number of 



charged residues, especially Arg, resulting in enhanced polar 
interactions, and (c) improved stabilization of secondary 
structures involved the higher number of residues in the 
P-strands and stabilization of the a-helix region. Some 
members of family 1 1 xylanases have a unique strategy to 
improve their stability, such as a higher number of ion pairs 
or aromatic residues on protein surface, a more compact 
structure, a tighter packing, and insertions at some regions 
resulting in enhanced interactions. 

Keywords: xylanase; glycoside hydrolases; family 1 1 ; 
thermostability. 



Xylanases (EC 3.2.1.8) are glycoside hydrolases that cata- 
lyze the hydrolysis of internal (3-1,4 bonds of xylan, the 
major hemicellulose component of the plant cell wall. The 
enzymatic hydrolysis of xylan has potential economical and 
environment-friendly applications. Xylanases can be used in 
bleaching of pulp to reduce the use of toxic chlorine- 
containing chemicals [1] or to improve the quality of animal 
feed [2]. In addition, there are applications in the food and 
beverage industry [3]. Therefore, attention is focused on 
discovery of new xylanases or improvement of existing ones 
in order to meet the requirements of industry such as 
stability and activity at high temperature and extreme pH. 

The xylanases that have been structurally characterized to 
date can be classified into the glycoside hydrolase families 10 
and 11, corresponding to former families F and G, 
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respectively [4]. Family 10 enzymes have an (a/P) 8 barrel 
fold with a molecular mass of approximately 35 kDa. 
Family 1 1 xylanases are somewhat smaller, approximately 
20 kDa, and their fold contains an a-helix and two (3-sheets 
packed against each other, forming a so-called (3-sandwich. 
Due to the industrial applications of xylanase, both xylanase 
families are well studied. In this paper, we focus on 
xylanases in family 1 1. 

To date, the crystal structures of family 1 1 xylanases are 
available from several organisms: Trichoderma harzianum 
[5], Bacillus circulans [5-7], Trichoderma reesei [8,9], Asper- 
gillus niger [10], Thermomyces lanuginosus [11], Aspergillus 
kawachii [12], Bacillus agaradhaerens [13], Paecilomyces 
varioti [14], and Dictyoglomus thermophilum [15]. Three of 
these, T. lanuginosus, P. varioti, and D. thermophilum are 
from thermophilic organisms. In addition, a low-resolution 
structure has been reported for thermostable Bacillus D3 
[16] but no PDB coordinates are available. Very recently, 
the structures of two new xylanases from Streptomyces sp. 
S38 [ 1 7] and Bacillus suhtilis B230 [ 1 8] have also been solved. 
A disulphide bridge has been suggested to be one reason for 
the enhanced thermal stability of T. lanuginosus and 
P. varioti xylanases [11,14]. A greater proportion of polar 
surface and a slightly extended C- term in us together with an 
extension of P-strand A 5 are thought to increase the stability 
of D. thermophilum xylanase [15,19]. Despite all these 
studies, the structural basis for the thermostability of family 
1 1 xylanases is not well understood. 

We report here the three-dimensional structures of two 
new members of family 1 1 xylanases. The crystal structure 
of the catalytic domain from Chaetomium thermophilum 
xylanase XynllA (CTX) has been determined at 1.75 A 
resolution and the catalytic domain from Nonomuraea 
flexuosa xylanase Xynl 1 A (NFX) at 2.1 A resolution. CTX 
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and NFX act optimally at 65-80 °C; NFX, in particular, is 
a remarkably stable enzyme, having a half-life of 273 min at 
80 °C and even 28 min at 100 °C In addition, NFX is 
active at pH 8. The crystal structures of CTX and NFX 
allowed us to make detailed comparison of 12 xylanases, 
five from thermophilic organisms. This gives a more reliable 
comparison of the enzyme structures in relation to their 
thermostability than earlier studies and helps us to under- 
stand the molecular basis of the thermostability of these 
industrially relevant enzymes. 

Materials and methods 

Protein purification 

The catalytic domains of C. thermophilum and N.fiexuosa 
expressed from Trichoderma reesei were purified from 
samples kindly provided by A. Mantyla (ROAL, Rajamaki, 
Finland). GenBank accession codes are AJ508931 and 
AJ508952 for C. thermophilum xylanase and N.fiexuosa 
xylanase, respectively. C. thermophilum xylanase was 
expressed in T. reesei as a full-length enzyme containing 
235 amino acids and N. fiexuosa as a construct coding 
mainly the catalytic domain (220 amino acids). However, it 
is likely that an extracellular protease has cleaved off the 
C-terminal tail of C. thermophilum xylanase (shortening was 
seen in SDS/PAGE) and possibly also several C-terminal 
residues outside the catalytic core of N. flexuosa xylanase. 
As determined by SDS/PAGE, the C. thermophilum xyla- 
nase was present as a ~ 26 kDa protein and the N flexuosa 
xylanase as « 28 kDa. Both xylanases were purified by 
cation exchange chromatography (CM Sepharose Fast 
Flow; Amersham-Pharmacia Biotech, Uppsala, Sweden) 
and hydrophobic interaction chromatography (Phenyl 
Sepharose Fast Flow, Amersham-Pharmacia Biotech). 
The procedure was essentially the same as that described 
for T. reesei xylanase [20]. The C. thermophilum xylanase 
was further purified on a Q Sepharose High Performance 
column (Amersham-Pharmacia Biotech), equilibrated with 
10 him citrate buffer (pH 4). A linear gradient of 0-0.25 m 
NaCl in 10 itlm citrate buffer (pH 4) was used to elute the 
enzyme. 

Enzyme assay 

The half-life of each xylanase was determined at different 
temperatures in 50 itim citrate/phosphate buffer, 
0.01 mg-mL -1 bovine serum albumin, pH 6.0. After incu- 
bations at each temperature, the residual activity of 
xylanase was determined by measuring the amount of 
reducing sugars liberated from 1% birchwood xylan [21]. 
The half-lives were determined for enzymes produced in 
T. reesei. 

Crystallization and data collection 

The catalytic domains of Chaetomium xylanase (CTX) and 
Nonomuraea xylanase (NFX) were crystallized by a hang- 
ing-drop vapor-diffusion method at room temperature. 
CTX crystals were obtained in 8 mL droplets containing 
approximately 5 mg-mL" 1 protein {A 2 w = 1 corresponds to 
the concentration 0.37 mg-mL 1 of protein) 0.7 m ammo- 



nium sulfate and 0.05 m Hepes at pH 7.2. Crystals of NFX 
were grown in 8 mL droplets containing 7 mg-mL 
protein, 0.4 m ammonium sulfate and 0.05 m sodium 
acetate 1 at pH 6.0. In both cases, the droplets were 
equilibrated against reservoir solution with a twofold higher 
concentration of ammonium sulfate and buffer. When 
sodium acetate buffer was used instead of Hepes, CTX also 
crystallized at pH 6-7, but these crystals diffracted only up 
to 3-4 A. Similarly with Hepes buffer, NFX crystallized at 
pH 7-8, but the crystals were not suitable for X-ray 
analysis. High quality crystals of CTX (dimensions 
0.5 x 0.2 x 0.2 mm) and NFX (0.3 x 0.2 x 0.2 mm) 
appeared in the drop after a few days and reached their 
final size in two weeks. 

Data were collected on a Rigaku RU-200HB rotating 
anode X-ray generator operating at 50 kV and 100 mA 
equipped with an Osmic Confocal Optics and an RAXIS- 
IIC imaging plate detector. Initially, the data sets of CTX 
and NFX crystals were collected at room temperature at 
resolutions of 2.4 and 2.3 A, respectively. Later, higher 
resolution data sets were collected at 120 K at resolutions of 
1.75 A and 2.1 A, respectively. Crystals from both xylan- 
ases were soaked in cryoprotectant solution containing 30% 
glycerol. The diffraction images were processed with denzo 
software and the data were scaled with scalepack software 
[22]. The space groups were defined using xprep program 
(shelx software package). CTX crystals belonged to the 
orthorhombic space group P2!2,2 with unit cell parameters 
a = 108.24 A, h = 57.15 A, and c = 65.68 A. The asym- 
metric unit contained two molecules. NFX crystals were 
hexagonal with unit cell parameters a,b = 37.03 A, and 
c = 191.81 A and they belonged to the space group P6i. 
The asymmetric unit of NFX crystals contained only one 
molecule. The data collection statistics are presented in 
Table 1. 

Structure solution and refinement 

Both structures were determined using the molecular 
replacement method with the AMoRe program [23]. The 
search model was Trichoderma reesei xylanase II (TRX II, 
PDB code 1ENX). Sequence identities of CTX and NFX 
(digestion site determined with mass spectrometer) with 
TRX II are 63% and 51%, respectively. The molecular 
replacement solutions were initially calculated from the 
room temperature data sets and the models were further 
improved with the high-resolution data sets. Iterative cycles 
of refinement and manual fitting were carried out using 
programs cns [24] and o [25]. To monitor the progress of 
the refinement, a total of 10% of the reflections were set 
aside for the R-free calculations. Refinements were carried 
out using the maximum-likelihood method with bulk- 
solvent corrections. The water molecules of the CTX 
model were positioned automatically with the wARP [26] 
but were also checked and finalized with the O. The water 
molecules of NFX were positioned with cns and o. 
Refinement statistics of the final models are presented in 
Table 1. The CTX model contained four sulfates and two 
of them at special positions. As the refinement programs 
are not able to refine covalent bonds at special positions, 
only the sulfur atoms of these two sulfates were modeled. 
However, the electron density map showed clearly the 
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CTX NFX 



Data collection 

Resolution range (A) (overall/last shell) 99-1.75 (1.81-1.75) 99-2.1 (2.18-2.10) 

No. of total observations 190677 48587 

No. of unique reflections 40787 8541 

l/I(cr) 27.1 17.9 

Completeness of data (%) (overall/last shell) 97.3 (90.9) 97.6 (92.8) 

Rsym (%) (overall/last shell) 7.7 (30.2) 10.7 (29.3) 

Refinement 

Resolution range (A) 99-1.75 99-2.1 

No. of reflections F > 0 a 39243 8135 

Rr„cop(%) 17.9 14.6 

R frcc (%) 21.6 20.9 

No. of non-hydrogen atoms 3637 1787 

Protein 3015 1544 

Water 603 183 

Carbohydrate - 50 

Ligand (glycerol) 6 6 

Ion 13 4 

Average B (A 2 ) 20.7 19.9 

Main chain 16.5 17.1 

Side chain 18.6 18.2 

Water 35.8 31.6 

Carbohydrate - 42.3 

Ligand (glycerol) 29.3 32.6 

Ion 67.9 27.0 

Rmsd from the ideal 

Bond lengths (A) 0.005 0.006 

Bond angles (°) 1.475 1.439 



shape of the tetrahedral corresponding to the sulfate ion. 
In the final model, two conformations of residues A 10, B10 
and A 123 of CTX were refined. There were also signs of 
other conformations of surface residues A32, A37, A38, 
A 62, A 70, A83, A 1 10, B32, B37, B57, and B70 of CTX. 
The final model was evaluated with procheck software 
[27]. 

Comparison of family 11 xylanases 

The coordinates of 10 different family 11 xylanases were 
available in PDB. First, the Cot atoms of all xylanase models 
were roughly superimposed with the O program. To create 
the final multiple alignment, stamp software [28] was 
applied. The secondary structures were assigned with the 
dssp software [29]. 

The solvent accessible surface areas of xylanases were 
calculated with the naccess software [30] using a 1.4 A 
probe. Van der Waals volumes were calculated using 1 .4 A 
probes and without probes with the voidoo program [31]. 
Numbers of hydrogen bonds were calculated for all 
xylanases using hbplus routine [32] with the default 
parameters for distances and angles. A salt bridge was 
assigned, when the distance between the two atoms of 
opposite charge was less than 4 A [33]. In all calculations, 
water molecules and hetero-atoms were excluded from the 
coordinate files and the chains were split. 



Mass spectrometry 

Mass spectra of CTX and NFX were measured by a 
Bruker BioAPEX II 47e FTICR mass spectrometer 
(Bruker Daltonics) using positive mode electrospray ioni- 
zation (ESI). This instrument is equipped with a passively 
shielded 4,7-T superconducting magnet, cylindrical infinity 
ICR cell and external electrospray ion source (Analytica of 
Branford). Aliquots of CTX and NFX were diluted with a 
methanol/water (1:1, v/v) solvent, followed by glacial 
acetic acid (1%) to obtain denaturing solution conditions 
for efficient proto nation in ESI. The final concentration for 
both proteins was approximately 0.5 mgmL" 1 . Samples 
were infused into the ESI-source by a syringe infusion 
pump (Cole-Parmer) at a rate of 50 uL h" 1 . Ionization 
voltage was -3.7 kV and ions were accumulated for 2 s in 
an RF-only hexapole ion guide before they were trans- 
ferred into the ICR cell for excitation and detection. The 
drying gas in a spraying process was pure nitrogen gas. All 
data were acquired and processed with a Bruker 
xmass 5.0.1. software. The mass spectra were calibrated 
against an acetonitrile-based ES Tuning Mix (Hewlett 
Packard) by peptide peaks in the m/z range of 200-3000. 
Molecular masses of observed proteins were calculated as 
average values over the charge state distributions using the 
esi mass macro program. Relative abundances of glycosyl- 
ated and nonglycosylated protein species were calculated 
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using the absolute intensities of the peaks appearing in the 
ESI-spectra. 

Results and discussion 

Overall structure of C thermophilum xylanase 

The overall structure of xylanase from C. thermophilum 
(CTX) was dominated by one a-helix and two strongly 
twisted p-sheets, which were packed against each other. This 
is the protein fold of family 1 1 xylanases. According to 
Torronen et al. [8], the shape of the molecule resembles a 
right hand: two p-sheets and the a-helix form fingers and 
a palm, a long loop between the B7 and B8 strands forms a 
thumb, and a loop between the B6 and B9 strands forms a 
cord (Fig. 1 A). 

The final model of CTX contained residues 1-191 for 
both molecules in the asymmetric unit (labeled A and B), 
The first residue, glutamine, was deaminated and cyclized to 
pyrrolidone carboxylic acid. When the ESI mass spectrum 
of CTX was measured, the unique molecular masses, 
21 479 Da and 21 682 Da, were obtained. Assuming that 
CTX contains 196 residues, the calculated molecular mass 
would be 2 1 478 Da, which agrees well with the lower mass 
obtained. The difference between the two obtained masses 
was 203 Da corresponding to one N-acetyl-glucosamine 
(GlcNAc). There is one potential iV-glycosylation site 
(Asn62) in the sequence of CTX, but there was no clear 
sign of glycosylation in the electron density map. It is 
possible that only the protein molecules without GlcNAcs 
had been crystallized or that the GlcNAc is disordered. 
According to the mass spectrum, approximately 20% of the 
material did not contain GlcNAc or alternatively, the 
GlcNAc had been lost. 

In the crystal structure, a glycerol molecule was located in 
the active site of molecule A, but was not observed in 
molecule B. The cryoprotectant soaking solution was most 
likely the source of glycerol, which was packed against 
Trpl9 by stacking interactions and was hydrogen-bonded 



to carboxyl group of Pro 127. In addition, Argl23 had two 
conformations in molecule A and in one of the conforma- 
tions the guadinine group of the Arg was located towards 
the hydroxy! group of the glycerol. The rms deviation 
between the A and B molecules of CTX was 0.8 A, 

The crystal structure showed four sulfate ions and a 
calcium ion in the asymmetric unit. The calcium ion was 
located between molecules A and B exactly on the 
noncrystallographic axis. The calcium ion interacted with 
side chains Oy of ThrlO from molecule A and B, both of 
which clearly had two conformations in the electron density 
map. Two of the sulfate ions were located exactly on the 
crystallographic axes. In addition to these two sulfate ions, 
which are attached to Arg residues A27 and B27, there are 
two other sulfates, which are attached to Arg residues A68 
and B68. Due to the crystal packing, the enzyme resembles a 
tetrameric assembly (Fig. IB). Four sulfate ions link 
molecule A to symmetry molecule D and correspondingly 
molecule B to symmetry molecule C. However, according to 
the dynamic light scattering measurements, the protein was 
a monomer. Therefore, the sulfate ions from the crystal- 
lization solution might have been involved in this 'tetra- 
merization' process. It is possible that tetramers were 
assembled first and their stacking then led to crystal 
formation in the high salt concentration. 

Overall structure of N. flexuosa xylanase 

The protein fold of the xylanase from N. flexuosa (NFX) 
was the same as that of CTX and other family i 1 xylanases 
(Fig. 2A). The crystal structure of NFX contained 197 
residues with a sequence GNPGNP at the C-terminus. The 
sequence GNPGNP seems to be a part of the C-terminal tail 
of the full-length NFX with total 301 residues. The amino 
acids of the C-terminal tail were sticking out from the 
model, but if the C-terminus of NFX is excluded, the 
enzyme is slightly more compact than CTX, probably due 
to short deletions. The electron density map showed that 
there was Ala at position 73 instead of Gly, which had been 




Fig. 1. CTX. (A) The overall structure of CTX. Glycerol and catalytic glutamates are shown in the active site. (B) A tetrameric assembly with 
sulfate ions. Molecules A and B are shown in white and symmetry molecules C and D in blue. 
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Fig. 2. NFX. (A) The overall structure of NFX with a glycerol molecul 
(B) The representative 2F 0 -F C electron density map from the final mode 
of 1.5 a. 

determined earlier by sequencing. This might be a sequen- 
cing error or mutation in the T. reesei strain. 

The crystal structure of NFX revealed that the enzyme 
has a single /V-glycosylation site at Asn7, where two 
yV-acetyl-glucosa mines and two ma mioses were attached 
(Fig. 2B). Carbohydrates are orientated in the same way as 
the backbone of (^strand Bl and therefore they are almost 
like an extension of the p-strand. yV-glycans are known to 
have a stabilizing effect and they may prevent the aggrega- 
tion of unfolded protein molecules [34]. However, NFX 
contains jV-glycans only when the enzyme is expressed in 
T. reesei. In the ESI mass spectrum, we were able to see 
six peaks in every charge state, corresponding to hetero- 
geneously glycosylated molecules. From the peaks of the 
most abundant charge state distribution corresponding to a 
mass of 23491 Da, we concluded that most of the material 
contained 206 residues, two GlcNAcs and three mannoses. 
The difference of 22 Da between the calculated (22577 Da) 
and the measured (22599 Da) was probably due to the 
formation of sodium adduct. 

On the protein surface, an acetate ion was located 3.2 A 
away from Serl87 Oy and 2.8 A from Ser36 Oy. The 
glycerol molecule was again found in the active site. It was 
slightly differently located in the active site of NFX than it 
was in the active site of CTX. The glycerol was packed 
against Trp20 (corresponding to Trpl9 in CTX), but it was 
slightly deeper in the active site. In NFX, Tyrl70 and Tyr78 
interacted with hydroxyl groups of glycerol. When this 
complex structure is compared with the complex structure 
of T. reesei xylanase with epoxyalkyl xylosides [35], the 
glycerol appears to mimic the binding of the xylose ring in 
the active site. The binding of glycerol to a single site may 
suggest that this site is the strongest binding subsite for the 
xylose subunits of xylan. 

Structural comparison of family 11 xylanases 

The C. thermophiium (CTX) and N.flexuosa (NFX) xylan- 
ases are much more thermostable than the mesophilic 
T. reesei xylanase II (TRX IJ). While TRX II was rapidly 



: in the active site. Carbohydrates attached to Asn7 are shown in gray sticks, 
of NFX. The figure shows the density of carbohydrates, contoured at a level 

inactivated at 55-60 °C, CTX was stable up to 60-65 °C 
and NFX was stable at 80 °C and it had some stability even 
at 90-100 °C (Table 2). However, the reasons for the 
significantly higher thermostability of NFX and CTX are 
not readily evident at the structural level, as they both 
resemble TRX II xylanase very closely. Because a number of 
solved three-dimensional structures of family 1 1 xylanases 
are now available for both mesophiles and thermophiles, we 
made a detailed comparison of the structures of these 
enzymes. Twelve structures used in the comparison are 
summarized in Table 3 and the sequence alignment is 
shown in Fig. 3. 

According to the sequence homology of family 1 1 
xylanases, which have a solved crystal structure, the 
enzymes can be divided into four groups (Fig. 4). The first 
group is formed from acidophilic xylanases A NX, AKX, 
and TRX I. The second group contains alkalophilic BAX 
and highly thermophilic DTX (sequence identity 58%). 
The third group is formed from thermophilic NFX and 
mesophilic BCX (sequence identity 59%). The fourth group 
contains mesophilic THX and TRX II together with 
thermophilic PVX, TLX, and CTX. BAX, DTX, BCX, 
and NFX are all from bacterial sources, whereas the others 
are fungal enzymes. 



Table 2. Half-lives of TRX II, CTX, and NFX. 



Temperature (°C) 


TRX II (min) 


CTX (min) 


NFX (min) 


50 


1480 






55 


20 






60 


2 


1500 




65 




58 




70 




15 




75 




7 


1500 


80 




4 


273 


85 






148 


90 






88 


95 






39 


100 






28 
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Table 3. Summary of the crystal structures used in comparison. 







PDB 


Tern perul u re 


nH 


Resolution 


M e u s u re men I 






Orgii nisui 


Code 


code 


preferc ncc 


preference 


(A) 


T (K) 


Li gun d 


Rofcro ncc 


/V. fiexuosa 


NFX 


1M4W 


Thcrmophilc 




2.1 


120 


Glycerol 


This paper 


C. thevmophilum 


CTX 


1H1A 


Thcrmophilc 




1.75 


120 


Glycerol in 
molecule A 


This paper 


D. thennophiluni 


DTX 


1F5J 


Therniophile 




1.8 


110 




[15] 


T. lamtginosus 


TLX 


] YNA 


Thermophile 




1.55 


295 




|H] 


P. varioti 


PVX 


1PVX 


Therniophile 




1.6 


295 




[14] 


T. rcesei 


TRX 11 


1ENX 






1.5 


295 




[8] 


B. circulans 


BCX 


1XNB 






1.5 


295 




[5] 


T. harzkmum 


THX 


1XND 






1.8 


295 




[5] 


T. reesei 


TRX 1 


1XYN 




Acidophile 


2.0 


295 




[9] 


A. kawachii 


AKX 


1BKI 




Acidophilc 


2.4 


295 




[12] 


A. niger 


ANX 


1UKR 




Acidophile 


2.0 


295 




[10] 


B. agar adhaer ens 


BAX 


1QH7 




Alkalophile 


1.8 


100 


p-D-xylopyranoside 


[13] 



When three-dimensional structures of family 1 1 xylanases 
are superimposed, their rmsd (root-mean-square deviation) 
values correlate well with the sequence similarities. The 
sequence identities of family 1 1 xylanases, including all 
molecules in the asymmetric unit, are shown in the function 
of rmsd values in Fig. 5. The sequence identity range for 
different family 1 1 xylanases was 31-97% and rmsd range 
was 0.2-1.4 A. The natural structural differences can be 
seen in the upper part of the figure (sequence identity 
100%). The high rmsd value 0.8 A, which exists between 
molecules A and B of CTX, is partly due to movements 
induced by glycerol binding. Therefore, we note that some 
of the structural differences among family 1 1 xylanases 
are caused by ligand binding. Both NFX and molecule A 
of CTX contain the glycerol while BAX contains the 
P-D-xylopyranoside in the active site. For the structural 
comparisons, other family 1 1 xylanases were chosen without 
ligand s. 

The superimposition of three-dimensional structures 
confirmed the subgroups of xylanases based on sequence 
similarities. For example, the lowest rmsd value of thermo- 
philic NFX is with mesophilic BCX (0.78 A), both belong- 
ing to group 3. NFX had a low rmsd (0.82 A) with molecule 
A of thermophilic CTX, showing that groups 3 and 4 are 
closely related. Molecule A of CTX has the lowest rmsd 
with mesophilic THX (0,71 A) and molecule B of CTX with 
mesophilic TRX II (0.75 A), all belonging to group 4. In 
group 1, alkalophilic BAX has the lowest rmsd with 
thermophilic DTX (0.79 A between molecule A of BAX 
and molecule A of DTX). 

As the crystal structures of mesophilic and thermophilic 
xylanases are very similar, it is likely that an array of minor 
modifications forms the structural basis for enhanced 
stability in thermophilic xylanases. Therefore, several 
factors, which are thought to be responsible for thermosta- 
bility, were compared between thermophilic and mesophilic 
family 1 1 xylanases. The alkalophilic BAX was not included 
in the same group as other mesophilic xylanases, because its 
functional properties seem to be different. Bacillus agarad- 
haerens grows optimally at unusually high pH (over 10). On 
the other hand, acidophilic TRX I, AKX and ANX would 
be considered as a separate group of acidic xylanases, but in 



our comparisons they were included in the mesophiles as a 
large number of mesophilic xylanases are slightly acidic in 
their activity profiles. The C-terminal tail (GNPGNP) of 
NFX was excluded. 

Sequence properties 

Frequencies of all 20 amino acids were computed for 
thermophilic and mesophilic family 1 1 xylanases (Table 4). 
It is obvious that the comparison of amino acid contents 
suffered from the low number of sequences and thus, 
statistical methods were not used to analyze the data. 
However, this comparison may still reveal some important 
trends and some of the trends in the amino acid frequencies 
could be related to the thermostability of xylanases. 

There was found an increased occurrence of argi nines in 
the thermophilic xylanases. Large-scale sequence compari- 
sons have shown that thermophilic proteins contain more 
argi nines on the protein surface than mesophilic proteins 
[36-38]. The effect of the large-scale increase in the number 
of argi nines was tested experimentally in T. reesei xylanase 
II [39]. These results showed that the introduction of five 
arginines into the Ser/Thr surface increased considerably the 
thermotolerance in the presence of the substrate. 

Another trend is that in thermophilic xylanases the 
frequency of Ser decreases and correspondingly the fre- 
quency of Thr increases (Table 4). Ser — > Thr mutation was 
one of the stabilizing mutations found by the early study of 
Argos et al. [40]. For thermophilic proteins, the decrease in 
the frequency of Ser but not the increase of Thr was 
observed by Kumar et al. [38]. These authors found that in 
thermophilic proteins, Arg and Tyr are more frequent, while 
Cys and Ser are less frequent. One possible explanation for 
this finding in xylanases is that the increase in the Thr : Ser 
ratio in P-strands (Table 5) improves the P-forming pro- 
pensities. Over half of the residues in the family 1 1 xylanases 
are located in the p-strands. 

In thermophilic xylanases, the frequency of asparagines is 
slightly lower (Tables 4 and 5). Asn has a low p-forming 
propensity, and thus might be avoided in the p-strands of 
thermophilic xylanases. The highly thermostable xylanase 
DTX showed a decreased frequency of Gly (Table 4) 
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Al Bl B2 A2 A3 B3 

TRXI asiNYDQNyqt ggQVSYSp SNTGFSVnw ntQDDFWGVGWt 

AKX agiNYVQNyngaLGDFTyd esAGTFSMywedgvSSDFVVGLGWt 

ANX giNYVQNyngnLGDFTyd esAGTFSMywedgvSSDFWGLGWt 

BCX astDYWQNwtdggglVNAVng&GGNYSVnw srtTGNFWGKGWt 

DTX altsnasgtfdgYYYE&wkd tgNTTMTvytOGRFSCqw snIWNALFRTGXk 

BAX qivtdnsignhdgYDYEFwkdaggSGTMIlnhGGTFSAqw nnVNNILFRKGKk 

NFX dttitqnqtgydngYFYSFwtdapgTVSMTlhsGGSYSTsw rnTGNFVAGKGWs 

PVX gttphsegwhdgYYYSWwsdgggDSTYTnnsGGTYEItv g nGGNLVGG KGWn 

TLX qttpnsegwhdgYYYSWv/sdggaQATYTnleGGTYEIsw gdGGNLVGGKGWn 

THX qqtigpgtgysngYYYSYvmdghaGVTYTnggGGSFTVnw snSGNFVAGKGWq 

TRXI I qtiqpgtgynngYFYSYvmdghgGVTYTngpGGQFSVnw snSGNFVGGKGWq 

CTX qtXtssatgthngYYYSFwtdgqgNIRFNlesGGQYSVtw sgNGNWVGGKGWn 

A5 B5 B6 CORD- 

TRXI t gsSAPINF gGSFSvNSGTGLLSVYGWSTNFLVEYYZHEDNHNYF a 

AKX t gsSHAITY S AE YS a S G S SS YLAVYGWWY PQAE YYI VEDYGD Y Npc S 

ANX t gsSNAXTY sAEYSaSGSASYLAVYGWVNYPQAEYYIVEDYGDYNpcS 

BCX t gspFRTINYnaGVWA PNGNGYLTLYGWTRSPLIEYYVVDSWGTYRp 

DTX yrvqn wqalgTIRITY sATYN PNGNSYLCIYGWSTNPLVEFYIVESWGNWRp p 

BAX fnetqthqqvgNMSINY gANFQ PNGNAYLCVYGWTVDPLVEYYIVDSWGNWRp p 

NFX t ggRRTVTY nASFN PSGNAYLTLYGWTRNPLVEYYIVESWGTYRp 

PVX p glnARAIHF tGVYQ PNGTSYLSVYGWTRNPLVEYYIVENFGSSNpSS 

TLX p glnARAIHF eGVYQ PNGNSYIAVYGWTRNPLVEYYIVENFGTYDpSS 

THX p gtkNKVlNF sGSYN PNGNSYLSIYGWSRNPLIEYYIVENFGTYNpst 

TRXIX p gtkNKVINF SGSYN PNGNSYLSVYGWSRNPLIEYYIVENFGTYNpst 

CTX p gt.dNRVINY tADYR PNGNSYLAVYGWTRNPLIEYYVVESFGTYDpst 

59 B8 THUMB B7 A6 

TRXI QGTVKGTVTSDGATYTIWENTRVWEPSX qGTATFNQYISVRNspr tSGTVTVQ 

AKX SATSLGTVYSDGSTYQVCTDTRTNEPSI tGTSTFTQY FSVREstr tSGTVTVA 

ANX SATSLGTVYSDGSTYQVCTDTRTNEPSI tGTSTFTQYFSVREs tr tSGTVTVA 

BCX TGTYKGTVKSDGGTYDIYTTTRYNAPSIdgDRTTFTQYWSVRQskrptgsNATITFT 

DTX GATSLGQVTIDGGTYDI YRTTRVNQPS I vGTATFDQYWS VRTs kr tSGTVTVT 

BAX GATPKGTI T VDGGTY DI Y ETLR VNQP S X kG I ATFKQYWS VRRs kr tSGTISVS 

NFX TGTYKGTVTTDGGTYDIYETWRYNAPSI eGTRTFQQFWSVRQqkr tSGTITXG 

PVX GSTDLGTVSCDGSTYTLGQSTRYNAPSI dGTQTFNQYWSVRQdkr sSGTVQTG 

TLX GATDLGTVECDGSIYRLGKTTRVNAPSI dGTQTFDQYWSVRQdkr tSGTVQTG 

THX GATKLGEVTSDGSVYDIYRTQRVNQPSI iGTATFYQYWSVRRnhr sSGSVNTA 

TRXIX GATKLGEVTSDGSVYDIYRTQRVNQPSI iGTATFYQYWSVRRnhr sSGSVNTA 

CTX GATRWGSVTTDGGTYNIYRTQRVNAPSI eGTKTFYQYWSVRTs kr tGGTVTMA 

HELIX B4 A4 

TRXI HHFNAWASLGL hLGQMNYQWAVEGWGGSGSAsqsvsn 

AKX HHFNFWAQHGF gWSDFNYQVMAVEAWSGAGSASVtis 

ANX NHfNFWAHHGF gWSDFNYQWAVEAWSGAGSAsvtis 

BCX HHVNAWKSHGMnlGSNWAYQVMATEGYQSSGSSnvtvw 

DTX DHFRAWANRGL nLGTIDQITLCVEGYQSSGSAnitqntf sqss 

BAX NHFRAWENLGM nMGKMYEVALTVEGYQSSGSAnvysntlringnpls 

NFX HHFDAWARAGM nLGSHDYQIMATEGYQSSGSStvs iseggnpgnp 

PVX CHFDAWASAGLrwTGDHYYQIVATEGYFSSGYAritvadvg 

TLX CHFDAWARAGLnvHGDHYYQIVATEGYFSSGYAritvadvg 

THX N H FN AWA S HGL tLGTMDYQIVAVEGYFSSGSAsitvs 

TRXIX NHFNAWAQQGL tLGTMDYQIVAVEGYFSSGSAsitvs 

CTX HHFNAWRQAGL qLGSHDYQIVATEGYYSSGSAtvnvg 



Fig. 3, Sequence alignment of family 1 1 xylanases. Structurally very similar residues are in capital letters. The coloring (red, a-helix; green, 3jo-type 
helix; blue, p-strand) depicts the secondary structure elements. 
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• TRX I 
ANX 
AKX 

8 A X 
DTX 
BCX 
NPX 
PVX 
TLX 

• CTX 

' TRX H 
THX 



Fig. 4. Phylogcnetic tree of family 1 1 xyl aliases. 
Lengths of branches indicate the evolutionary 
distances. 



100 
90 
80 
70 

^ 60 

? 50 

g 40 
30 
20 
10 



0 0,2 0,4 0.6 0,8 1 1,2 1,4 1,6 

rmsd (A) 

Fig. 5. The plot of sequence identity as a function of rmsd value for 
family II xylanases. 

compared to both mesophilic and other thermophilic 
xylanases. Avoidance of Gly probably increases the rigidity 
of the loop regions. However, there is no general trend 
toward decreased frequency of Gly among thermophilic 
family 1 1 xylanases. Pro does not seem to play any general 
role in the thermostabilization of these enzymes. 

Thermophilic xylanases have substantially less Val 
(Tables 4 and 5), Although Val has a good P-forming 
propensity, still its frequency is lower in the p-strands of the 
thermophilic xylanases (Table 5), indicating the increase in 
the (3-forming propensity is not of primary importance in 
xylanases if some other property is more critical for 
thermostability. In addition, thermophilic xylanases contain 
more amino acid residues in the solved crystal structures 
than mesophilic xylanases (Table 4). The higher frequency 
of charged residues is involved in increasing the number of 
polar interactions. 

Secondary structures 

Facchiano et al. [41] observed that 69% of the a-helices of 
thermophilic proteins are more stable than their mesophilic 
counterparts. The stabilizing factor was the intrinsic helical 
propensity of amino acids. Lack of p-branched residues 
(Val, Thr, lie) correlated significantly with thermostability. 



In the case of xylanases, there is only one a-helix in the 
structure. The a-helix of thermophilic xylanases showed a 
higher frequency of Asp and Arg (Table 5). In NFX, the 
additional Arg 160 is located on the protein surface, and 
Asp 156 makes a double salt bridge with Arg58 (Oyl and 
Oy2 atoms of aspartic acid and Nrjl and Nr|2 atoms of 
arginine). In CTX, Argl61 makes a salt bridge with Asp57. 
Hence, the a-helix region of these two enzymes is likely to be 
stabilized by additional interactions with the loop before the 
P-strand A5. DTX, NFX and TLX have both Asp and Arg 
in the a-helix and the residues are located at positions 
(i, i + 3) or (i, i + 4), which is believed to be stabilizing 
[42], In addition, CTX has Met and Phe side chains at 
positions (i, i + 4), also thought to be stabilizing [43]. In 
NFX, TLX and CTX, the positively charged Arg is located 
at the C-terminal end of the a-helix, suggesting that it 
stabilizes the helix dipole [44]. 

The total number of positions with a p-strand structure 
was higher in thermophilic xylanases (Table 5). The 
thermophilic xylanases had, on average, 123 residues 
(range 121-128) in the P-strands and the corresponding 
number in the mesophilic xylanases was 1 14 residues 
(range 106-118). This result indicates that longer p-strand 
rigidify the protein and, thus, make it more thermostable. 
Alkalophilic BAX had as many as 131 residues in its 
P-strands, which indicates that the overall stability of the 
p-strands may be important for the alkalitolerance of 
family 1 1 xylanases. All thermophiles and BAX have an 
additional P-strand Bl at the N-terminus, which could 
have a stabilizing effect. However, mesophilic TRX II and 
THX also have this additional P-strand, The highly 
thermostable DTX has a clearly longer p-strand B3 and 
C-terminal p-strand A4, which most likely stabilize the 
structure. The C-terminal p-strand A4 gives additional 
hydrogen bonding with P-strand A5 and the extension of 
P-strand B3 interacts with a P-strand B4. BAX also has a 
longer C-terminal p-strand A4 and a short additional 
p-strand after that. 

When the three-dimensional structures of all xylanases 
are superimposed, a striking feature, in addition to the 
lengths of the terminals, is that thermostable DTX and 
alkalophilic BAX have a long insertion between p-strands 
B3 and A5. According to McCarthy et al. [15], the loop 
between B3 and A5 combined with extended C-terminus of 
DTX gives additional hydrogen bonding and hydrophobic 
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Table 4. Total amino acid composition. The largest di (Terences between thermophiles and mesophiles are in bold. 





NFX 


CTX 


DTX 


TLX 


PVX 


i hprmn 

1 1 1 \* I III V' 


TRX II 


BCX 


THX 


TRX 1 


AKX 


ANX 




BAX 


% Ala 


4.2 


5.2 


4.5 


6.7 


4.6 


5.1 


3.7 


4.8 


4.7 


5.1 


7.6 


8.2 


5.7 


3.9 


% Val 


4.2 


6.8 


5.0 


6.7 


6.2 


5.8 


7.4 


7.5 


6.8 


10.1 


8.2 


8.7 


S.l 


6.8 


% Leu 


2.6 


3.1 


5.0 


4.1 


3.6 


3.7 


2.6 


2.1 


2.6 


3.4 


2.2 


2.2 


2.5 


4.3 


% He 


4.2 


3.1 


5.5 


3.6 


3.1 


3.9 


4.7 


3.2 


5.3 


3.4 


2.7 


2.7 


3.7 


5.8 


% Fro 


2.6 


2.6 


2.5 


3.1 


3.1 


2.8 


3.7 


3.2 


3.2 


3.4 


1.6 


1.6 


2.8 


3.4 


% Met 


1.6 


1.0 


0.5 


0.0 


0.0 


0.6 


0.5 


1.1 


0.5 


1.1 


1.1 


0.5 


0.8 


2.4 


% Phe 


3.7 


2.6 


3.5 


2.6 


2.6 


3.0 


4.2 


2.1 


3.7 


3.4 


4.9 


4.9 


3.9 


3.4 


% Trp 


4.2 


3.7 


4.0 


4.1 


4.1 


4.0 


3.2 


5.9 


3.2 


3.4 


2.7 


2.7 


3.5 


3.4 


% Gly 


13.6 


14.1 


9.5 


14.9 


16.0 


13.6 


14.2 


13.4 


14.2 


12.4 


10.3 


10.3 


12.5 


11.6 


% Ser 


10.5 


8.9 


10.1 


6.7 


11.3 


9.5 


11.6 


9.6 


12.6 


12.9 


15.8 


15.2 


13.0 


8.2 


% Thr 


15.2 


12.6 


14.1 


9.3 


10.8 


12.4 


8.4 


13.4 


8.9 


10.1 


10.9 


10.9 


10.4 


7.7 


% Cys 


0.0 


0.0 


1.5 


1.0 


1.0 


0.7 


0.0 


0.0 


0.0 


0.0 


1.1 


1.1 


0.4 


0.5 


% Tyr 


8.4 


9.4 


7.0 


8.8 


8.8 


8.5 


8.9 


8.0 


9.5 


5.6 


9.2 


9.2 


8.4 


6.3 


% Asn 


5.8 


8.4 


8.5 


6.2 


7.2 


7.2 


10.5 


9.6 


1 0.0 


10.1 


6.5 


6.5 


8.9 


10.6 


% Gin 


4.2 


4.2 


5.5 


4.1 


3.6 


4.3 


5.3 


2.7 


3.2 


6.2 


3.3 


2.7 


3.9 


4.3 


% Asp 


3.7 


3.1 


3.5 


6.2 


5.2 


4.3 


2.1 


3.7 


2.1 


2.8 


4.9 


4.9 


3.4 


3.9 


% Glu 


3.1 


2.6 


2.0 


4.1 


2.6 


2.9 


2.1 


1.1 


2.1 


2.8 


4.3 


4.3 


2.8 


3.4 


% Lys 


1.6 


1.6 


2.0 


1.5 


1.0 


1.5 


2.1 


2.7 


2.1 


0.6 


0.0 


0.0 


1.2 


4.3 


% Arg 


5.2 


5.2 


5.0 


4.1 


3.1 


4.5 


3.2 


3.7 


3.2 


1.7 


1.6 


1.6 


2.5 


3.9 


% Wis 


1.6 


1.6 


0.5 


2.1 


2.1 


1.6 


1.6 


I.I 


2.1 


1.7 


I.I 


1.6 


1.5 


1.9 


Total number 


191 


191 


199 


194 


194 


194 


190 


187 


190 


178 


184 


184 


186 


207 


% Non-polar 


27.7 


27.5 


30.6 


30.9 


27.3 


28.8 


30 


31 


30 


33.1 


31 


3L5 


31.1 


33.3 


% Polar 


58.2 


58.7 


56.3 


51 


58.8 


56.6 


58.9 


56.7 


58.4 


57.3 


57.1 


56 


57.4 


49.3 


% Charged 


14.1 


13.8 


13.1 


18.1 


13.9 


14.6 


11.1 


12.3 


11.6 


9.6 


11.9 


12.5 


11.5 


17.4 



Tabic 5. Amino acid composition in a-helices and p-strands. The largest 
differences between thermophiles and mesophiles are in bold. 





ot-Helices 




p-Strands 




Thermophiles 


Mesophiles 


Thermophiles 


Mesophiles 


Ala 


2.0 


2.0 


5.4 


5.3 


Val 


0.2 


0.7 


9.8 


14.2 


Leu 






3.8 


2.5 


lie 


0.2 




6.2 


4.8 


Pro 






1.4 


1.3 


Met 


0.2 




0.8 


1.3 


Phe 


1 


1.3 


4.4 


5.0 


Trp 


1 


1 


6.4 


5.0 


Gly 


0.6 




11.0 


9.8 


Ser 


0.2 


0.5 


12.0 


13.0 


Thr 


0.8 


0.5 


16.8 


12.8 


Cys 


0.4 




1.0 


0.3 


Tyr 






15.6 


14.3 


Asn 


0.6 


2.0 


6.0 


8.0 


Gin 


0.2 


0.3 


5.4 


4.5 


Asp 


0.8 




3.0 


3.0 


Glu 






4.8 


4.3 


Lys 




0.2 


2.0 


1.3 


Arg 


0.8 




5.0 


3.0 


His 


1 


1 


1.8 


0.2 


Total 


10 


9.5 


122.6 


114.2 



has a short helix instead of 3i 0 -type helix, but there is no 
extension of p-strand B3. 

Furthermore, the structures of cord, thumb and loop 
regions vary among family 11 xylanases. It has been shown 
that these areas are flexible both in crystals and in the 
molecular dynamics simulations [45]. Some of the differ- 
ences are evident in the ligand binding [35]. Loops are 
typically the regions with the largest temperature factors, 
indicating that they might unfold first during thermal 
denaturation [46]. However, the overall temperature factors 
of mesophilic and thermophilic xylanases were not com- 
parable because the data sets of different xylanases have 
been collected at 120 K or at room temperature. Tempera- 
ture factors are dependent, in addition to the resolution and 
the programs used in the refinements, on the temperature of 
crystal during data collection. 

Several xylanases have short insertions or deletions in the 
loops (Fig. 3). Mesophilic BCX has a short insertion 
between P-strands B7 and A 6 and mesophilic ANX and 
AICX have short insertions between p-strands A3 and B3, 
but there is no clear trend that shortened loops would be 
associated with thermostability. The loop between p-strands 
B2 and A2 has an interesting feature that could play a role 
in thermostability. The thermophilic NFX has Pro in this 
loop, which increases the rigidity and this might have a 
stabilizing effect. The other highly thermostable xylanase, 
DTX, has a deletion in this loop. 



packing. They suggest that these factors may account for the 
enhanced thermal stability. In fact, the loop of DTX 
includes regular secondary structures: the extension of 
P-strand B3 and 3| 0 -type helix. Structurally similar BAX 



Disulfide bridges 

Thermophilic PVX and TLX have a disulfide bridge that 
connects the C-terminus of the p-strand B9 with the 
N-terminus of the oc-helix. According to the experimental 
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data, introducing disulfide bridges via site-directed muta- 
genesis has increased the thermostability in T. reesei and 
B. circukms xylanases. Disulfide bridges at the N-terminus 
or in the a-helix region improve the thermostability by 
10-15 °C [20,46-48]. However, the disulfide bridge alone 
cannot be crucial for the enhanced thermal stability of 
xylanases due to the fact that highly thermostable DTX, 
NFX and CTX do not contain disulfide bonds. In DTX, 
two of the cysteines are close enough to form a disulfide 
bridge between P-strands B5 and B4 in the catalytic area, 
but are not reported to do that according to the electron 
density map [15]. In addition, the mesophilic AKX and 
ANX have a disulfide bond between the cord and the 
p-strand B8, indicating that stabilization of the a-helix 
region as well as other weak areas like N-terminus by 
various strategies is more important for the thermostability 
than the disulfide bridge alone. 

Salt bridges and hydrogen bonds 

There are increasing data that indicate a role for hydrogen 
bonds and salt bridges in protein stabilization [37,38]. In 
family 1 1 xylanases, the number of salt bridges varies 
between 2 and 12 (Table 6). There is one completely 
conserved salt bridge between the C-terminal Glu (or Asp) 
of p-strand B6 (BAX and BCX have Asp) and Arg of the 
loop between p-strands B7 and A6. Thermophiles tend to 
have more salt bridges, but on the other hand mesophilic 
TRX II has as many as eight salt bridges. Alkalophilic BAX 
has the largest number of salt bridges, while acidophilic 
xylanases have the lowest numbers. Apparently, there could 
be a correlation between alkalitolerance and salt bridges. 

Thermophilic xylanases have, on average, slightly more 
hydrogen bonds than mesophilic xylanases, except the total 
number of hydrogen bonds in thermophilic CTX is lower 
than that of mesophilic TRX IT (Table 6). Thermophiles, 
especially NFX, have a large number of side chain-side 
chain interactions. 

Packing 

It has been proposed that thermophilic proteins have a 
tighter internal packing with smaller and less numerous 
cavities than mesophilic proteins [49,50]. To study packing, 
we calculated the protein density and the void volume 
values for all family 1 1 xylanases (Table 7). Because 
thermophilic xylanases have more atoms, the void volumes 
were divided by the total number of atoms to normalize 
them. Both the protein density and void volume values for 
thermophilic and mesophilic xylanases were similar. Only 
highly thermophilic DTX and alkalophilic BAX have 
slightly higher protein density and lower void volumes 
indicating better packing. 

In the comparison of PDB structures, Karshikoff and 
Ladenstein [51] have observed that proteins from thermo- 
philic and mesophilic organisms essentially do not differ in 
packing. They suggest that neither the reduction in packing 
density nor the reduction of the packing defects can be 
considered as a common mechanism for increasing thermal 
stability. On the other hand, Chen et al. [52] observed in the 
mutagenesis study that the stabilizing mutations in Sta- 
phylococcal nuclease resulted in improved packing, with the 



volume of the mutant protein's hydrophobic cores decreas- 
ing as protein stability increased. Apparently, a few protein 
families or some members in them may use better packing to 
improve the thermostability. Our study indicated that 
highly thermostable DTX may benefit from the better 
packing. Adaptation to alkaline pH might also benefit from 
better packing. 

Hydrophobicity and surface characteristics 

Because protein cores are typically hydrophobic, increased 
packing efficiency is often correlated with increased hydro- 
phobicity. Tighter packing can be achieved through the 
formation of hydrophobic clusters and enhanced van der 
Waals interactions. Increased hydrophobicity is usually 
involved in decreased accessible surface areas and a higher 
percentage of buried atoms [53]. According to our calcula- 
tions (data not shown), thermophilic xylanases have slightly 
more apolar interactions on average than mesophilic 
xylanases, but if the number of interactions are divided 
with the number of residues, the trend is not as clear 
anymore. 

As thermophilic xylanases contains more amino acid 
residues than mesophilic xylanases, they also have larger 
accessible surface areas (Table 7). So far, all family 1 1 
xylanases are reported to be monomers, therefore the 
solvent accessible areas are not buried by oligomerization. 
When accessible surface area is counted per atom, it appears 
that DTX and BAX may benefit from increased hydro- 
phobicity as well as better packing. In addition, these two 
xylanases have on average longer side-chains (atoms per 
residue) than the other family 11 xylanases studied 
(Table 7). 

One type of hydrophobic interaction is the closely packed 
aromatic ring-ring interaction, which has been calculated 
to have nonbonded potential energies between 1 and 
2 kcal moP 1 [54]. Additional aromatic-aromatic inter- 
actions are believed to contribute to the increased stability 
[55]. Bacillus D3 xylanases, which belong to family 1 1 (no 
PDB coordinates available), have eight additional surface 
aromatic residues which are believed to form 'sticky patches' 
on the protein surface that may lead to protein aggregation 
[16]. In addition, introduction of a single tyrosine into 
the N- terminal region has been reported to improve the 
thermostability and thennophilicity of Streptomyces xyla- 
nase considerably [56]. However, the studied family 1 1 
xylanases did not show any general trend toward increased 
proportion of aromatic residues (Table 4). 

It is thought that increased fractional polar surface, 
which results in added hydrogen bonding to water, 
contributes to the greater stability [37]. Table 7 shows the 
solvent accessible areas and the fractions of polar and 
nonpolar surface areas. Thermophilic xylanases have 
somewhat larger fractional polar surfaces, especially CTX 
and DTX. This indicates that polar interactions on the 
protein surface are important for the stabilization of family 
1 1 xylanases. 

Conclusions 

It appears from the analysis of three-dimensional structures 
and sequence properties of family 1 1 xylanases that there 
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Table 7. Structure statistics. Protein density = VdW(0 A)/VdW (1.4 A); void volume = VdW (1.4 A) - VdW (0 A). The largest differences 
between thcrmophilcs and mcsophilcs arc in bold. 

NFX CTX DTX TLX PVX thcrmo TRX II BCX THX TRX I AKX ANX mcso BAX 

Total number 1506 1495 1586 1512 1493 1518 1485 1448 1471 1348 1394 
of atoms 

Atoms/residue 7.88 7.83 7.97 7.79 7.70 7.83 7.82 7.83 7.74 7.57 7.66 

ASA (A 2 ) 8170 8250 8260 8150 8140 8190 7960 7810 7800 7490 7580 

8140 8340 7900 

ASA/atoms 5.43 5.52 5.21 5.39 5.45 5.40 5.36 5.39 5.30 5.56 5.44 

5.44 5.26 5.32 

Non-polar (%) 50.4 48.0 47.2 52.0 50.9 49.7 51.3 51.8 53.3 50.5 50.4 
48.0 47.6 51.6 

Polar (%) 49.6 52.0 52.8 48.0 49. 1 50.3 48.7 48.2 46.7 49.5 49.6 

52.0 53.4 48.4 

Protein density 0.550 0.545 0.556 0.544 0.541 0.547 0.549 0.547 0.549 0.543 0.548 

0.548 0.555 0.550 0.544 0.559 

0.545 
0.544 

Void 10.48 10.69 10.30 10.69 10.85 10.59 10.51 10.61 10.50 10.81 10.54 10.71 10.62 10.26 

volume/atoms 10.57 10.30 10.45 10.70 10.16 

10.68 
10.70 



1388 


1422 


1664 


7.67 


7.73 


8.04 


7600 


7700 


8560 


7600 




8490 


7600 






7600 






5.48 


5.42 


5.14 


5.48 




5.10 


5.48 






5.48 






50.4 


51.3 


48.8 


50.4 




49.8 


50.4 






50.4 






49.6 


48.7 


51.2 


49.6 




50.2 


49.6 






49.6 






0.544 


0.547 


0.556 



are several minor modifications that correlate with the 
increased thermostability. Increase in the frequency of Arg 
is a known determinant in the thermostability, in the same 
way as the increase in the Thr : Ser ratio. The bulky side 
chain of Arg offers a possibility for several hydrogen bonds 
and involvement in salt bridges. A general stabilizing 
principle appears to be an improved network of inter- 
actions, which is reflected in the increased frequency of total 
number of atoms, charged amino acids and hydrogen 
bonds. Another feature appears to be the increase in the 
number of residues in the p-strands. This indicates that the 
two-layered p-sheel has an important role in the stability of 
family 1 1 xylanases. However, the frequencies of amino 
acids with high p-forming propensity may either decrease 
(less Val) or increase (higher Thr : Ser ratio). The a-helix 
region of thermophilic xylanases is stabilized by various 
strategies including additional hydrogen bonding, salt 
bridges or disulfide bridges. It is evident that all these 
changes increase the protein rigidity, which is a property 
usually associated with enhanced thermostability. 

We found also some features that can explain the 
increased thermostability in specific cases. The highly 
thermophilic NFX had a great number of side chain-side 
chain polar interactions and several salt bridges. There 
could be a trend in xylanase structures that acidophilic 
xylanases have only few and atkalophilic BAX several salt 
bridges. It is possible that alkaline and thermal adaptation 
use partly the same mechanisms for improving the stability. 
Better packing of the thermostable DTX and the alkalo- 



philic BAX is likely to improve thermostability. Thermo- 
philic CTX and DTX have an increased fractional polar 
surface, which creates more hydrogen bonds with water. 

Several experimental studies have been published on the 
stability of family 1 1 xylanases. Two major regions affecting 
the thermostability are the protein N-terminus and the 
a-helix region [19,20,47,48,56], Thermophilic xylanases 
appear to have a more stable a-helix than their mesophilic 
counterparts. The mechanism of stabilization at the 
N-terminus is not so obvious. All thermophilic xylanases 
have an additional p-strand Bl at the N-terminus, but also 
mesophilic TRX II and THX contain this P-strand. The 
N-terminus of NFX is a couple of amino acids longer than 
that of TRX II, but there is no clear reason why the longer 
N-terminal might increase the stability. However, the 
extension of the N-terminus of TRX II has been reported 
to increase the thermostability [57]. The third region with 
effect on the thermotolerance seems to be the Ser/Thr 
surface. The introduction of five arginines into this surface 
increased the apparent temperature optimum by % 5 °C 
[39]. Thermophilic CTX has two additional arginines, 
Arg27 and Arg68, on that surface. Although there are 
more arginines in thermophilic xylanases, the presence of 
arginines on Ser/Thr surface does not seem to be a widely 
used strategy among family 1 1 xylanases. In conclusion, a 
number of minor modifications appear to explain the higher 
stability of thermophilic family 1 1 xylanases and many 
thermophilic xylanases have unique features that may 
increase their stability. 
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FIG. 1. Mapping of the functional domain of B, circulans xyla- 
nase gene cloned in pBCX549-R3. The 4.3-kb B. circulans DNA 
insert was subjected to subcloning at various restriction sites (B, 
BamHl; H 2 , Hindi; H 3 , tfmdlll; P, Pstl; R, EcoRI). Deleted areas 
are indicated with dotted lines. Xylanase expression levels obtained 
with the various subclones are given on the right. Clearing zones 
extending 1, 2, 3, and 4 mm from colonics are designated +, + + , 
+ + + , and + + , respectively. The location and extent of the 
xylanase gene functional domain are indicated by the shaded box. 
The solid box indicates the location of enhancer sequence, and the 
arrow indicates the direction and location of the LacZ' gene 
promoter. All subclones carried pUCl9 as the vector with the 
exceptions of pBCX632(L) and pBCX632(S), which carried pTZ18 
and pBCX673 which carried pUC9. NT, Not tested. 



macia). Protein bands were visualized by Coomassie brilliant 
blue staining. 

RESULTS 

Mapping of (he xylanase gene functional domain in 
pBCX549-R3. The functional domain of the xylanase gene 
was located by subcloning of the 4.3-kilobase (kb) B. circu- 
lans DNA insert. The subcloning strategy and the results of 
the xylanase assays are summarized in Fig. 1. A 1,0-kb 
//i/idlll-f/mcll subfragment was found to code for xylanase 
activity. After 24 h of growth, virtually all of the activity 
produced by the various subclones was located in the 
cytoplasmic fraction- However, the amount of activity pro- 
duced by the different clones varied greatly. Maximum 
levels were observed with pBCX600, a subclone in which the 
0.3-kb upstream sequence was retained and the noncoding 
downstream sequence was deleted. 

Expression levels. The influence of the upstream sequence 
on expression level is demonstrated by the differences in 
xylanase production by the two primary clones (pBCX549- 
Rl and pBCX549-R3) and by the effect of elimination of the 
sequence in some of the subclones. The difference in expres- 
sion level between pBCX600 and pBCX632(S) is particularly 
striking. It was observed that xylanase overproduction by 
the pBCX600 clone was accompanied by a change in cellular 
morphology. Many of the cells were up to 10 times longer 
than normal E. coli cells. Deletion of the sequence down- 
stream from the coding region resulted in a significant 
increase in expression level. The different xylanase levels 
produced by pBCX632(L) and P BCX632(S) suggest that the 
orientation of the gene (31) with respect to the p-galactosi- 
dase promoter is of some importance. 

The effects of the upstream sequence and the p-galaclosi- 
dase promoter orientation were further investigated by con- 
struction of a subclone (pBCX839-3) from pBCX600 in 
which the orientation of the //i/icll-//mdlll fragment was 
reversed (data not shown). This was accompanied by a 50% 
reduction in expression level. Clones pBCX839-3 and 
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FIG. 2. TSK CM chromatography of intracellular xylanase pro- 
duced by E. coli harboring pBCX549-R3. Symbols: , A^; — , 

NaCI; O, xylanase. 

pBCX632(L) are identical with respect to (3-galactosidase 
promoter and xylanase gene orientation, yet the expression 
level for pBCX839-3 was seven times that of pBCX632(L). 
This indicates that the EcoJU-HindJU fragment also en- 
hances expression level when positioned downstream of the 
xylanase gene. 

Subclone pBCX673, which was constructed by insertion 
of the 2.2-kb Pstl-BamUl fragment of pBCX549-R3 into the 
corresponding sites of pUC9, was halo positive (Fig. 1), 
while a recombinant constructed in the same manner with 
pUC19 as the vector was halo negative. The nucleotide 
sequence near the Pstl site of pBCX673 indicated that the 
first 10 codons of the p-galactosidase gene contained in 
pUC9 had been inserted immediately upstream of the mature 
xylanase gene and in the same reading frame: 5'... . ATG 
ACC ATG ATT ACG CCA AGC TTG G£I GCA QCT 
AGC. . . 3' {Pstl site underlined). In effect, the 10-residue 
N-terminal peptide encoded by the (3-galactosidase gene 
replaced the 28-residue prepeptide of the wild-type xylanase. 

Levels of xylanase production by E. colt harboring 
pBCX600 in 2YT medium and 2YT supplemented with 
larchwood or oat-spelt xylan were similar. Addition of 
isopropyl-P-D-thiogalactopyranoside was also without ef- 
fect. Addition of glucose or xylose to the medium resulted in 
much-reduced enzyme levels. For this reason, the organism 
was grown in unsupplemented 2YT for xylanase production. 

Southern hybridization analyses. Brief autoradiography 
showed that EcoRl digests of B, circulans and B. subtilis 
genomic DNA contained a common 4.0-kb band that hybrid- 
ized with pBCX549-R3 (data not shown). The existence of a 
common 22-kDa xylanase gene in these organisms was 
further confirmed by hybridization between pBCX549-R3 
and BamHl digests of genomic DNA. Digests of B. circulans 
and B. subtilis DNA contained 9- and 15-kb bands, respec- 
tively, that hybridized strongly with pBCX549-R3. 

Xylanase purification. After 24 h of growth, more than 95% 
of the activity was located in the intracellular fraction. 
Fractionation of the intracellular material on a TSK CM 
column separated the xylanase activity, which adsorbed to 
the column, from most of the other UV-absorbing activity 
(Fig. 2). Sodium dodecyl sulfate-polyacrylamide gel electro- 
phoresis of the xylanase fraction revealed the presence of 
small amounts of impurities. The xylanase fraction obtained 
by ion exchange chromatography was further purified by 
permeation chromatography on a TSK G2000 column. The 
activity was associated with the larger of two protein peaks 
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FIG. 3. Sodium dodecyl sulfate-polyacrylamide gel electropho- 
resis or intracellular fraction of E. coli harboring pUCl9 (A), 
intracellular fraction of £. coli harboring pBCX549-R3 (B), purified 
xylanase (C), and molecular weight markers (D). Numbers on the 
right are molecular weights (10*) of the markers. Approximately 20 
p.g of intracellular protein and 2 u.g of purified xylanase were applied 
to the gel. 



and eluted late in the run (data not shown). The enhanced 
expression of the enzyme in £. colt harboring pBCX600 is 
evident from the size of the xylanase peak in the TSK CM 
column chromatogram (Fig. 2) and the intensity of the 
xylanase band in the intracellular fraction of cells harboring 
this plasmid (Fig. 3). 

Xylanase characteristics. The active fraction obtained by 
the purification procedure was shown to contain a single 
band with a molecular weight of approximately 22,000 (Fig. 
3). On the basis of sequence data (31), the actual molecular 
weight of the protein is known to be 20,500. The protein 
eluted from the gel permeation column at a position much 
later than would be expected for a molecule of this size. The 
enzyme had an adsorption maximum at 280 nm and a molar 
extinction coefficient of 53,000. It apparently bound 
Coomassie blue poorly, since the Bradford protein assay 
gave a value that was 1/10 the actual amount, as determined 
from the weight of lyophilized pure enzyme. 

DISCUSSION 

It was previously reported (31) that the coding sequence of 
the pBCX600 insert was virtually identical to that described 
earlier (18) for the B. subtilis gene which codes for 22-kDa 
xylanase. The expression level data reported in the present 
article point to the significance of the 0.3-kb upstream 
fragment, since its deletion resulted in a drastic drop in 
intracellular xylanase levels. The nucleotide sequence of this 
fragment has been determined and has been shown to 
contain a modified promoter sequence and two pairs of 
tandem repeats resembling some animal virus enhancer 
elements (1, 31, 32). The EcoRhHindUl fragment described 
in this report behaves like eucnryotic transcriptional en- 
hancer elements in that expression levels are influenced by 
the location of the fragment in positions both upstream and 
downstream of the xylanase gene. The reason for the in- 
creased expression that was observed upon deletion of the 
sequence downstream from the coding region is unknown. It 
could be the result of increased copy number of the smaller 
plasmid and enhanced translation of the xylanase gene in the 
smaller plasmid. There is a (N) . . . -Ala-Ala-Ser- . . . (C) 
juncture sequence between the prepeptide and mature pep- 
tide (31). This sequence is also found in the pBCX673 
subclone, in the genes encoding several extracellular B. 
subtilis proteins (14), and in other xylanases (9, IB). The 



fragment also contains a Shine-Dalgarno sequence (24) or 
putative ribosome-binding site. The 3'-flanking sequence of 
the insert contains a pair of inverted repeats which have 
been proposed as transcription termination signals (21, 31). 

The upstream sequence contained in the clone described 
here is considerably longer than that reported for other 
clones of homologous genes from other bacilli. A ribosome- 
binding site and possible promoter sequences were identified 
in a 60-base-pair (bp) sequence upstream of the initiation 
codon in a clone of the B. pumitus gene (9). A clone of the B. 
subtilis gene contained a 115-bp sequence upstream from the 
initiation codon (18). Possible promoter sequences were also 
identified in this sequence. The upstream sequence in the B. 
circulans clone described here contains 391 bp. A putative 
promoter sequence is located at 50 to 57 bp upstream of the 
initiation codon. The two pairs of tandem repeats that are 
thought to act as enhancer elements (31) are located at 256 to 
269 bp and 286 to 302 bp upstream und are not present in the 
B. pumilus or B. subtilis clones. Very high expression levels 
in E. coli have been reported for two xylanase genes from 
two strict anaerobes, Bacteroides succinogenes (25) and 
Clostridium acetobtttylicum (33). The C. acetobutylkum 
enzyme has a molecular weight of approximately 28,000 (33). 
The Bacteroides succinogenes enzyme has not been charac- 
terized. Regulation of gene expression was not studied in 
either instance. 

Commercial application of cloned endoglycanase activi- 
ties such as use of cellulases and xylanases in the processing 
of lignocellulosic activities is hampered by low enzyme 
yields. However, there have now been several reports of 
high expression levels of these enzymes in £. coli. O'Neill et 
al. (17) obtained overexpression of a Cellulomonas fimi 
cellulase gene in E. coli by placing the gene under the control 
of the leftward promoter of the bacteriophage lambda. 
Enhanced gene expression was obtained by thermal inacti- 
vation of the heat-sensitive lambda cI857 repressor. The 
same strategy has been used to obtain high levels of Clos- 
tridium thermocellum cellulase expression in E. coli (22). C. 
thermocellum cellulase overproduction in E. coli has also 
been reported by JolifT et al. (12). As with the cloned B. 
circulans xylanase gene described here, overexpression was 
fortuitous. Since the C. thermocellum gene was not se- 
quenced, no attempt was made to explain the molecular 
basis of overproduction. 
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Expression of a Trichoderma reesei 3-Xylanase Gene 
(XYN2) in Saccharomyces cerevisiae 
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TheXYNl gene encoding (tie main Trichoderma reesei QM 6a endo-|M,4-xylaiiase was amplified by I'CR from 
first-stmnd cDNA synthesized on mRNA isolated from the fungus. The nucleotide sequence of the cDNA 
fragment was verified to contain a 699-bp open reading frame thot encodes u 223-amino-acid propeptide. The 
XYN2 gene, located on WfcU-based multicopy shuttle vectors, was successfully expressed in the yeast Saccha- 
romyces cerevisiae under the control of the alcohol dehydrogenase II iADH2) and phosphogly cerate kinase 
{PGK1) gene promoters and terminators, respectively. The 33-amino-acid leader peptide of the Xyn2 (J-xyla- 
nose was recognized and cleaved at the KexMike Lys-Arg residues, enabling the efficient secretion and 
glycasylatton of the heterologous P-xylnnase. The molecular mass of the recombinant 0-xylanase was estimated 
by sodium dadecyl sujfote-polyacrylamide gel electrophoresis to be 27 kDa. The construction of furl ura3 S. 
cerevisiae strains allowed for the autoselection of the VBA J-based XYN2 shuttle vectors In nonselective complex 
medium; These autoselectlve 5. cerevisiae strains produced 1,200 and 160 nkat of p-xylonase activity per ml 
under the control of the ADH2 and PGKJ promoters in rich medium, respectively. The recombinant enzyme 
showed highest activity at pH 6 and 60°C and retained more than 90% of its activity after 60 min at S0°C. 



Xylan is a major component of the cell walls of monocots 
and hardwoods, representing up to. 35% of the dry weight of 
these plants (31). This polymer is second only 10 cellulose in 
natural abundance and represents a major reserve of fixed 
carbon in the environment. Unlike cellulose, .\ylan is a complex 
polymer consisting of a 0-D-l,4-linked xylopyranoside back- 
bone substituted with acetyl, arabinosyl, and glucuronosyl side 
chains. Hydrolysis of the xylan backbone is catalyzed by endn- 
p-l,4-xylanase$ (EC 3.2.1.8) and p-D-xylosidases (EC 3.2.1.37) 
(5). Endo-3-xyInnascs act on xylans and xylo-oligosaccharides, 
producing mainly mixtures of xylo-oligosaccharides. p-D-Xylo^ 
sidases hydrolyze xylo-oligosaccharides to o-xylose (5). Many 
bacterial and fungal species arc able to utilize xyiuns as u. 
carbon source. Interest in the enzymology of xylan hydrolysis 
has recently increased because of the application of P-xyla- 
nases in biobleaching (27 T 42) and in the food (23) and animal 
feed (45) industries. Several microbial sources have been in- 
vestigated for P-xylanase production, and strains of the fungus 
Trichoderma have been shown to secrete large amounts of 
efficient xylan-degrading enrymes (48). 

The yeast Saccharomyces cerevisiae can neither utilize nor 
degrade xylan (18), but it possesses a number of attributes that 
render it an attractive host for the expression and production 
of p-xylanases (32). It is a unicellular fungus which provides for 
posltranslational processing such as endoproteolytic cleavage 
and giycosylation. S. cerevisiae normally secretes few proteins 
in low abundance, so that a secreted heterologous protein is 
produced separate from the majority of yeast proteins. Fur- 
thermore, it has complete GRAS (generally regarded as safe) 
Minus, which allows for its use in the food industry. 

Trivhoilvrma reesei is a filamentous mesuphilic fungus that is 
well known for its ccllulolytic and xylanolytic enzymatic activ- 
ities (29). The two major inducible cndo-xylnnases secreted by 
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this fungus are Xynl and Xyn2 (44). They are both relatively 
snail protein molecules with molecular masses of 19 and 21 
kDa, respectively. The isoelectric points of Xynl and Xyn2 are 
5.2 anil 9.0, respectively. The pH optima of Xynl (pH 3.5 to 
4.0) and Xyn2 (pH 4.5 to 5.5) differ, but the proteins produce 
similar hydrolysis end products (44). Xyn2 represents more 
than 50% of the total xylanolytic activity of 7*. reesei cultivated 
on xylan and, together with Xynl, accounts for more than 90% 
of the xyhin-ilegrading ability of this fungus (43). 

In this paper, we describe the molecular cloning of the T 
reesei XYN2 gene in 5. cerevisiae. Expression of the XYN2 gene 
in S. cerevisiae was obtained with the aid of multicopy plasmids. 
using two different S. cerevisiae promoter-terminator expres- 
sion cassettes derived from the inducible alcohol dehydroge- 
nase II (ADH2) gene (35) and the constitutive 3-phosphoglyc- 
erate kinase {PGK1) gene (15). The enhanced production of 
recombinant p-xylanasc in nonselective complex medium, 
without the risk of losing the cpisomal vector, was obtained by 
d:srupiing ihe uracil phosphoribosyllransfcrase (FURf) gene 
(!9) in the f^xylanasc-producing S. cerevisiae sirains. This step 
ensured autoselection of the l//M.?-bearing expression plas- 
mids in rich growth medium. 

MATERIALS AND METHODS 

Strains and medio. The genotype* the microbial strain* und phsmiih u«d 
in ihc present study ure summarized in Table \ .S. cwiiiae Y2U4 was cultivated 
on cither rich YPD medium (11? yeast cornet. 2<£ peptone. 2** iflucusej or 
Klccirvi.*. buffered synthetic complete (SQ medium (2fi* glucose. 0.67^ ycu-J 
nitiogen ha*c |Difco| containing amino ncid supplement 20 mM succinate |pH 
o.O]), respectively (33). T. nriri QM 6a w;t* cultivated in nasal medium {U.3CI, uiti 
spelts lylnn (Sigma). 0.4<Tc KH : P0 4 . IS- (NfUjHPO,. \ r r peptone. 0.3* yean 
eitmctj (25). Bolh the<c organisms were culiuied m I -liter fMcnmeycr flask* 
containing 100 to ZOO ml nf medium at JUX on u ruliiry shaker at 150 rpm. 

Recombinant plasm ids were constructed and amplified in Escherivhia eoti 
JM101 cultivated ut JTC in Luria-Bertuni liquid medium or un tjirin-Bcnnni 
apr (38). AtripiciMn tur select ing and propagating remiunt b;icierin *ns artrteiJ 
to a final concentration of 100 u-g/ml, 

DNA inn n I pu In I Ion s and plosmld construction*. .Standard protocol* were lul- 
lowed for DNA manipulutiuns |3«). Tin: enzymes loi DNA dcuvyge and ligation 
aid plmsphorylaied synthetic fthkcr> were purcluscd Mum Boehringer Munn- 
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TABLE I. Microbial strains and plasmids used in ihis study 



Strain or 
plasmid 



Relcxani gcnoiypc 



Source or 
reference 



Strains 
5- re/miiue Y294 
T. rttiti QM tin 
E. coli JM101 



Plasmid* 
YEpC-PADH2a 
pUC8 
P UC19 
pBLI 
pBL2 
YEp352 
pDLCl 
pJCI 
pDLG5 
pDLGA 
pPE 
pDFl 
YEplJ 



a leu 2-3. 112 uraJ-52 /itr.? rrp 1-289 
Type culture 

f»t' f^t' * U P£ thiO[tae-proAB) {¥' 
traD36 prvAB tacPZ AM 15) 



bta LEU2 ADf /2 rT 

bta tdi 

bta tct 

bta ADH2 n 

bta A0H2 r 

bta URA3 

bta URA3 AUH2 rT 

bta URAJ PCKt P j 

bta URAJ ADH2 f *XYN2-ADH2 T 

bta URA3 PGKl fXWl-PGKt r 

bta TRP1 FORI 

bta furl Y.UEU2 

bta LEVI 



This laboratory 
ATCC 13631 
38 



30 
46 
49 

This laboratory 
This laboratory 
14 

This work 
S 

This work 
Tli is work 
19 

This work 
7 



heim and mcd as recommended by the supplier Restriction endonuelease- 
digoled DNA.wat clulcd from agarose gels by the method or Benson (4).. 

The construction or plakmid pDLGl is summarized in Fig. 1A. A 1.4-kb 
EtoMSatl fragment containing the S. cemiiiae ADH2 promoter region wns- 



EXPRfSSlON OF T. REESEl .W2 IN S. CEREViSLAE 1037 

-ecovenrd from plasmid YEpC-PADH2a (30) md ligated lo plasmid pUCH. 
%hich'had been Unearned with EcuRl and Sail The unique £cuRV lite up- 
iircatn of ihc.-U>//.' promoter legion and the Hindi site (also the Sat\ site; in 
■.he pUCfl moiety were suhsequemly changed to a BamH\ and BtfU <tic. respec- 
K-cly. through the addition at synthetic linkers to create plasmid pDLI. The 
ADH2 terminator region was recovered from YEpCPADH2a as a O.H3-kb5moi- 
3ctl fragment and cloned into the Smul 'and BamHl sites of plasmid pUCB. The 
Smal site wu cleaved and changed to a DglU site through the addition of a 
lynthctic Afc/II linker lo generate plasmid pUL2. VhcADH2 promoter region was 
iubsequcntry isolated as a l.4-kb Eco R\-Ugt\\ fragment from plasmid pDLI and 
jubcloned into the EcwRI nnil flfcfll sitca of plasmid pUL2. The tallowing syn- 
thetic oligodcoxyribonucJcuttdcs were annealed to generate a polylinkcr with the 
unique restriction sites £VoRI. Bglll. and A7ioI: XBR-I (S'-CCTCGAGOAjj 
AT^GQMIICCO:) and XBR-2 ( 5 ' • G GAATJCC A 0 ATCTCfTCG A 0 G - 
3'). The BgtU site between the, ADM promoter, and icrminatbr regions w;is 
cleaved, and the protruding ends were filled in with DNA polymerase I {Klenow 
fragment). The EcoRl-BgMXhot polylinkcr was introduced in place of the BgtU 
iitc. The 2.1-kb fio/nHI-Windlll promoier-terminator fragment was isolated 
from the plasmid and cloned inlD the BamHl and Hmdlli sites of the ycast-£. 
epti shuttle vector YEp352 (14). The unique Eco Rl site in YEp352 was deaved. 
ihc protruding ends were rilled in with KJenuw fragment, and the blunt ends were 
leligatcd prior to the introduction of the ADH2 fragment io ensure that the 
tVoRI within the polylinkcr remains unique. The final ADH2 yeast egression 
rector was named pDLGl. Plasmids pDLGl and pJCI (8) Were used as cxpics- 
iton vectors for XYNZ 

Plasmid pDFl was constructed as follows. The 2.2-kb PstUEcoKL fragment 
containing the 5. cere\-tsiae FURt gene wns isolated hrim plasmid pPE (19) and 
ilgatcd to pUC 19. predigestcd with Pstl and £coRl. The FURt gene was cleaved 
at the unique Accl and Ctn\ sites, the overhanging ends were filled in with DNA 
polymerase I (Klenow fragment), and a Sail iynthetic linker was added to 
generate a W0-bp Her l-C/u I deletion within the FUR} open reading frame. The 
LEU2 gene was isolated from plasmid YEp!3 (7) as a 12-kb Satl-Xfiol fragment 
md duned into the San site created within the FURI open reading frame to 
construct n furt:±EU2 disruptive allele, the resulting integrating plasmid was 
designated pDFl. 
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FIG I Schema lie nummary of the construction of plasmid pDLG I (A) and plosmid maps of ih« A'W2-containing P'^* m, j* ^ ^ j^J.?. n ^ ^ ^ ' c ' ' 
& cli< indie: ted hy crunched .he ^elcaahle nmken { LEU2, WW J. »nd bin* ure mdicaitdby hatched boxes, xhc ADH2 and PUK prnmo.ef and tcrnunalor 

L,ucncc arc indicn.ed by upen hues, and the 2 (1 m ycaM origin of repliclion and p3R323^UC« "Minces a,e indicated by thick and I h m l,n«>. rnpcciivdy. The 
rcMrienoh Mtc, ■nd.ca.cd L toil II (Bm). ticl\ (Ucj: ^11 IHgl, E<oM (Rl). HxndLl (Hd), /fell (Hp). Sail iS). Smul fS.n). and XmA (X). 
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A 4.0-kilobase (kb) fragment of Bacillus circulans genomic DNA inserted into pUC19 and encoding 
endoxylanase activity was subjected to a series of subclonings. A 1.0-kb HindllhHincll subfragment was found 
to code for xylanase activity. Maximum expression levels were observed with a subclone that contained an 
additional 0.3-kb sequence upstream from the coding region. Enhancer sequences in the upstream region are 
thought to be responsible for these high expression levels. Southern hybridization analyses revealed that the 
cloned gene hybridized with genomic DNA from Bacillus subtilis and Bacillus polymyxa. Xylanase activity 
expressed by Escherichia coli harboring the cloned gene was located primarily in the intracellular fraction. 
Levels of up to 7 U/ml or 35 mg/liter were obtained. The protein product was purified by ion exchange and gel 
permeation chromatography. The xylanase had a molecular weight of 20,500 and an isoelectric point of 9.0. 



The structure of xylan, a major component of plant cell 
walls, is quite complex in that the (3-1,4-linkcd D-xylose 
backbone is highly substituted with acetyl, arabinosyl, and 
uronyl groups (2). The enzymatic degradation of xylan to 
monomeric material requires several distinct activities, but 
endoxylanase, as the depolymerizing enzyme, is of particu- 
lar significance. Most biochemical studies have focused on 
fungal xylanases such as those produced by Aspergillus 
niger (6-8, 16, 23), Trichoderma harziamtm (26-28), and 
Schizophylhim commune {5, 13, 19). It has been shown thai 
xylanolytic organisms produce multiple forms of xylanase 
with different physical and kinetic properties (6, 8, 23). It is 
not known how these enzymes act in concert and to what 
extent the multiple forms are the result of differentia! mRNA 
processing or posttranslational modification. 

A 20- to 22-kilodalton (kDa) xylanase has now been 
identified in xylanase preparations from both fungal and 
bacterial sources (9, 13, 18, 20, 27, 29, 31). Sequence data for 
such xylanases from Bacillus circulans. Bacillus subtilis, 
Bacillus pumilus, S. commune, and 7\ harziamtm have 
shown extensive sequence homology (Yaguchi et al., Abstr. 
Meet. Am. Chem. Soc. 1988, V195, part 1, p. 127). This 
enzyme has now been identified in several Bacillus spp. It is 
the smallest of at least two distinct xylanases produced by 
these organisms (11, 29, 30). B. pttmilus (9, 20), B. subtilis 
(18), and B. circulans (29, 31) genes encoding the 20- to 
22-kDa enzyme have been cloned and expressed in Esche- 
richia coli. On the basis of sequence data and DNA hybrid- 
ization studies, a high degree of homology has been shown to 
exist among the 20- to 22-kDa xylanases (9, 18, 31). Expres- 
sion levels, like those in the Bacillus spp. donor strains, have 
been quite low. This report details the construction of a 
clone in which the levels of expression of the B. circulans 
xylanase gene are much higher than those previously 
achieved with cloned xylanase genes or with the Bacillus 
spp. donors. 

MATERIALS AND METHODS 

Organisms and growth conditions. E. coli HB101 harboring 
recombinant plasmids was grown in 2YT broth as described 

* Corresponding author. 
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previously (29). Plasmids pBCX549-Rl, pBCX549-R3, and 
pBCX549-P5 carry B. circulans xylanase genes and were 
constructed as described previously (29). PJasmid pBPX277 
contains a Bacillus polymyxa xylanase gene (30), and 
pRH200, kindiy provided by M. Paice, harbors a B, subtilis 
xylanase gene (18). 

Subcloning of pBCX549-R3. Large preparations of plasm id 
DNA were isolated by the alkali-sodium dodecyl sulfate 
method of Birnboim and Doly (3). Small preparations were 
isolated as described by Guo et al. (10). Plasmid DNA was 
completely digested with restriction enzymes, and the result- 
ing fragments were separated by electrophoresis in 1.2% 
low-melting-point agarose gels containing TAE buffer (40 
mM Tris hydrochloride [pH 7.8], 20 mM sodium acetate, 2 
mM EDTA). Purified subfragments were isolated from the 
gels (30), ligatcd with an appropriate vector, and trans- 
formed into E. coli HB101 (30). 

Hybridizations. Southern hybridizations were carried out 
as described previously (30). EcoRl and BamRl digests of 
genomic DNA from B. circulans, B. polymxya, and B. 
subtilis were electrophoresed in a 0.8% high-melting-point 
agarose gel containing TAE buffer and were hybridized with 
a 32 P-labelcd pBCX549-R3 probe. 

Xylanase assays, Xylanase assays contained \% oat-spelt 
xylan and were carried out as described by MacKenzie el al. 
(15). One unit was defined as the amount of enzyme releasing 
1 fimol of reducing sugar per min. 

Localization and purification of xylanase. Periplasmic and 
intracellular fractions were obtained by osmotic shock and 
sonication procedures as described previously (30). Protein 
concentrations were estimated by the method described by 
Bradford (4), with pooled immunoglobulin as the standard. 
Intracellular xylanase preparations were fractionated by 
high-pressure liquid chromatography. Samples were applied 
to a TSK CM-3SW column. (7.5 by 150 mm) and eluted at a 
flow rate of 1 ml/min with a 0 to 1 M NaCI gradient in 10 mM 
phosphate buffer (pH 6). Fractions containing xylanase were 
freeze-dried, redissolved, applied to a TSK G2000SW gel 
permeation column (7.5 by 600 mm), and eluted with 50 mM 
phosphate buffer (pH 6). Fractions were analyzed by sodium 
dodecyl sulfate-polyacrylamide gel electrophoresis in 10 to 
15% polyacrylamide gradient gels in a PhastSystem (Phar- 
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RNA isolation and firsMstrand cDNA prcparaliun. One utei >>t /' wwi IJM 
bn culture prepared in o.ii <pcti; h-isal medium for- l< h al .U' C. The luruul 
mvcclia were harvested through ehrr»(v : oih and lro/en .imki 'mind mintp^it. 
The fiu/.en tmeeli.t ucic emund into a rtnc powder with a muriar and pe->!le Und 
sufpc nded in a mixture ol 15 ml '>: phenol ami Ml ml ul itc cold STE buifci I UH> 
mM NaCl. 2 ^0 in\1 Tiis-HCI (pH 7.2 j. 10 mM EDTAi. and total cellular UNA 
was isolaied at detained previously (221. The pnlyt A |.cniit.nniiH! RNA fraction 
was purified *ith ilie :tid of an Oliuuie.\-dT mRNA midi kit (Strata trend, fust- 
si rand cDNA synthesis was curried nut with M'H> ne nf polyiAJ RNA by using a 
lirM-Mraml cDNA .synthesis kit i, Bochrineer Mannheim) a> >peeilled by die 
supplier. 

PCR amplification. The 7* wwi ATA7 i*ene-w*iM ijulaied fmin a fiiM-snaud 
cDNA mu hv PCR.*iih ihc two rtliconiiclentides DLGIL (5 -GCAT(jAATir 
GCC-A/\ACCrGAAC/\ACCC..V| and DLOIR (V GCATAliAjriCrcnT 
AGLTGALGGTCA-3'1. These primer* u-erc hased an the sequence of the T 
rvesci M'S2 gene a* published by Turriinen ct al. |Jl), DNA was amplilied in 
50-uJ reaction mixtures (0,Z5 mM each primer, react inn buffer IV. I mM MiiCh. 
500 u.M each deoxynudeoside triphosphate, 2 u.1 of template DNA (i 211 ng of 
first-strand cDNA-mRNA hybrid|. 2.5 U tit" Tm\ polymerase [Advanced Biotech 
nn1n&y|) under mineral oil with a Uiumetra Trio fherrrobloek TBI (Biometra 
BinmcdirtniulK Analytik. Gotringcn, Germany) ( P. 37). Dcnaliiraliun. anneal* 
my. and putymciizutinii wcic carried out for 1 min at I min at and 
1 min a I 72 S C, respectively, for 29 eyelet Tlte amplilied DNA fragment war* 
diyesittl wiih flg/II ;wd EioR] and ligJied to pDLGl picdigesicd iviih the same 
restriction enzymes. The resulting plasmid. pDLGS (Fig. ID). **as tronsformcd 
into 5. cfrti'isiot Y2'.M bv ihc dimethyl siilfmidc- lithium acetate method de- 
scribed by Hill et al. 03).' 

Screening for P-jrjtannc ottklty. rranblurnnmts w-cre screened iat \ylan- 
degrading ability after being plated un SC" C ^ medium or SO medium (after 
disruption of the FURt gene) containing 0.2'~t uf 4-0-meihy|.n*glueurunii-n- 
vjlon-rcmaznl brilliant blue R (RBB)-.xylan (Sigma} und 2*7 galneluae as the 
carbon source (fi. 10). p-Xylunasc cleaves RBB-\ylan into a colorless pruducl. 

SubclonlnR und lequencinj; of XW2. The 0-xylanasc uenc rrom pDLG? wiw 
cloned imp pUCR. nnd four deletion subclones were constructed with apprupri- 
yic ru-biriction cniymti. The nudeniide sequence of the XYS2 gene wa* deter- 
mined by scquenctnn both strandi of ihe cloned cDNA inserts by the radioactive, 
dideoiry chain lerminatiun meihoct (.W)vvitli V D.VA ptilym erase /Pliaimaciaj-as 
specified by the manufatturcr. Tlic Nci|i»ertt.*e U^ia obtained were analyzed with 
the GENEPROsottware package (Hocler Scieniific InsirumrnK. San Fnincivtv 
Calif.). 

SiMithem and Northern hybridiiallom. Total DNA na*. Ki»l.neil (rum S tcr- 
aiitac strains ( Ui), digested with .VwT and S'cai. separaled on a 1' * ayaruie tjcl. 
und hlntted to n Hybnnd-N membrane (Amcrsham International). Soutliern 
h>bndi/iitions ucre carried nut by ihc method described hy Southern (41 1. 
modified jv dewnbed by Sambronk ct al. (3H). Northern (RNA) hybridizaunnfi 
were cut tied out as dcHtilwl bv Sambrook et al i.'UtJ.'For boih Southern and 
Northern hylindizuiiuiii. sinylc-snaiuled DNA fraument* ^ere labcllcJ w>i:h a 
random-pnmed DNA-labellinu kit ilkjchnnger Mannheitn) ioukuie (u-'*P| 
ATP as speciHed by. the manufaciurcr 

Pmifin preparation ond gtl clectropniirr tis. Pruiein piepunuion-f were per- 
tunned tHt ice. The . supernatant ol a 2t«t.ml culture >il Iht* rmimhmant $ 
t i-frt/jjuc *ir;tin >eparated from the cellular ma^s by eentntugation U>r 5 mm 
;ii l.iXK) ' e. Tlie *upernalam was tiltcicd and euneenttaicd .n a Uulbi Ultr.i- 
tiltcr PMM> eoiiccntratoi (Amicon Division ol W. R. Grace and Co.. Dative r*. 
Mu^.l und stthieifuenily precipitated with 2 vnlumes nf ice eeld acetone fur ) b. 
Tht- intruccllular prolein fruiriion was irtnacted jfier the eclls uere ivashed i^iee 
in 5 ml ol extraction hutler (ZIH) mM Tri.s (pH H ill- MM) niM iNH.J-SO,. It! mM 
MgCU. I mM liDTA. Itl'.V jjlyieiol) .uid -.wo uliimaicly suspended in n.4 ml of 
the same buller. Cells' were ruptured v^iih t.i. il 2 tu J ml ke-tohl .ieid-M;t>hed 
gla>S bcadi (0.45 u-m diameter) in a 2-ml mkiOLcnltihuii.- iul>c h) vutieMiij; mi 
times for }\i s with (»()•> inicr^ation ice. Glass beads and cell debns were removed 
by cenirifiigatinn at 13.5IHI x for rit) min. The supernal;* ill suhAequenily 
prcetpitated by the same piocedure :is w.ith the extracellular Inution. lloth 
pr.neui Iractions were re impended in ll. t in) of HE PES huller (20 m.M .V-J- 
hyttnmi.-ih\lpipt:razine-.V , -2-ethaitLSultimtc ucid (HEP'ES: pH H.t)). 5 m.M 
liDTA'ipl I H.IVj. 7 mM |J-mercnptoetbantiK afkr cam ilugaimn at * ,e lor 

It) mm. Sodium ditdcL>l>ulfatt:-puly:ier>lamide eel elect i cphoi e^ts I SDS-PAGE I 
on I V.'J ihi/voI) polyiiLTyhimide svas perlormed l>y ihc niethiwJ nf l.acmnili (21 ) 
Intracellular and extracellular prMcin lr:iciinn«' u l tc bmlcd lnr '» (ton .unl ;ip 
plied if I hi. gel. Proteins were visujbzed bv L'otimitsvk' tuilli.tni lilnv •.i.iimmii: 

1'rnteln diijkeusylalltin. The *upernaiani of a 2lXi-ml culture nl ibe (nomln- 

nam S. n»*/r u j is scpai :i led from the cellular ma>s hycemntuyalion tin *> umt 

at 4.um • c ami iiltr.it um thromth a i),22-u.m-porc->izc membrane, I hc rcu-m- 

hm.inl pldU -ii ol 27 kl);i ( pioU'itO iia* »Cparaied lli»»i ihe Othci pti'lcills 

vcerelcil l»y ,S, t'wnuh- 1 lariiei Hmn *. * lT):i| lis usnij! ; t Dinllo ( 'lltalillcr PXHll 
I'MtKi-iiliatni ( Aiiinoii l)i\iMi'ii ot VS U Ci'raitcc and ( o ) .md ."ul^k.niieiMty 
prcLipnaicil trttm Hie liltrliie uuh 1 i<muncv n| .^vntnc <"i I b Ihc fn*icin 
ftaeiii'ii was collected by Ceninlu^iUfii at l.t.^Hl • e litr '»0 mm .it If and 
ic<us|vitdcd in 1^5 ml ot cudoj:lyct*%ul>>c I iiiciih;Ui'in hinlct I ^H ni .M si 'duo it 
autaic bulki |pll >\. 2o mM 1.1)1 A. I'. 2-mcrcapn vihanoh I he tk.KKifsi 
l.ilniM luivhan m.in liiitiaUd bv ttddme " 2H »') c»do^bio»lda-c I iSicmal ' 
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FIG. 2 Recombinant fl-.xy Ian ase -producing .V eercvisiae strain^ The colonic- 
at the lop of plates A and D are 5 ccnntsiat \2*M[fitrlrLtV2 pDLGl 1 (colonic 
I ami 4). ihnse at die hoitnm left are 5. ceirvuiae ^2't4(/iir7 :LtiL'J pDLC;*: 
(culcnies 2 and 5); and those at the bottom right ar^ S. ccnrnjwe Y2 , i4"( t /nr/' 
LEU2 pDLG6) (colonics } and fi|. Botli plates euruain SC medium and 0.2'. 
RUB-xylun iupplemenlcd with Yr elucose tA) or 2"- galncto>e (B) x, the suit 
carbon source. C'uluniei dcijriiiling RBB-Tylan arc SLiroundcd by pnlc clearitiv 
zonci. The plates were photographed after 30 b of incubniioii ai .tfTC. 



u.1 of denatured protejn (hoileil for 5 min at ID0*C). The reaction nmtuie. \<u* 
incubated at 37*C tor 12 h. alter u-hich the \ample denatured again and n2 
U of frcsb cndoglycoiidasc 5 was added. Alter a further 16 h of incubation, the 
reaction mixture *"as denatured again and SDS-PACL un 1 5*1 (wtAull pub 
acrybmidc was performed 12\), Untreated Xyn2 protein was treated in iher wrii. 
manner, but no endoglyci>sidnsc F was added. 

l>i)mt activity ;i{u\a. Endn-p- L4.«ylanase activiry was assayed by irw 
mctfoil i described by Bailey ct al. (1 ) with \ r i birchwu^xj clucutonaNylan (Sigma i 
as thr iub»irale ut fit) : C. Appropriate dilutions of the cell-free culture solution n> 
50 rr.M. wdturn citrate butler (pH-h.Oj wcie used as ihe enryme source. The 
amount of released sugar uai determined by the ditiitrosalicylic acid method 
described by Miller et al. (261. Thcrmustnbiliiy was icsted hy healing enn-me 
samples for diifereni times at various temperatures, anil the activity was assayed 
at (*>*C for > min as described above. A*wys at different pU values were pci 
Inrmcd as described above, except that Ihe butter; used were 3U mM citrate (pi I 
3.(1). 50 mM citrate phusjihaie (pH 4.0 to 1.0). and 50 m.M phusphaic (pi t tU'i. 
respectively ( II ). 

RESULTS 

Cloning of Ihc 7". reesei XYN2 gene. The T. reesei XYN2 gene- 
was isolated from firsi strand cDNA prepared from reeici 
by using sequence -specific PCR primers. The PCK produci 
was cloned into pDLGl (creating pDLG5 |Fig. IB)) under the 
eumrol of the inducible ADH2 promoter and transformed inio 
5. ccrevishiK Y2VJ,. 0 -. X.y la nnse- producing yeast colonies were 
idem i lied by pin ling on se lee live SC medium containing RBD- 
xyten. The f3-xylanase gene wns isolated from pDLG5 and 
cloned imopJCl (creating pDLG6 [Fig. 1C|) under the control 
of :he constitutive PCK J promoter. This plasmid wns also 
introduced intu S. cerevisiav YIVA. The FORI gene of the 
above-mentioned ycusi siraitis was disrupted to create strain** 
.9. arcvisiac \2 t M{J'wi::LEU2 pDLC5) and S. cervvisiac 
[)'uti\:LEV2 pDLG6). respectively. Both ihese recombinant 
yeasi strains, together with S. ccrcvisiac \29A(furl ::LEU2 
pDLGl) d.s a control, were plnted on SC medium contain- 
ini» 2 r .r utuciise or 2 %. galactose a.s the curhoi) source, respec- 
tively (.Fig. 2A and B). S. ccrcvisitic Y2 l Wi>r/::L£L'3 pDLG5) 
showed smaller clearing zones, because ihe A OH 2 promoter is 
strongly repressed by l r -\. glucose bui less so by. galactose. 

Nucleotide sequence of Ihc A77V7 gene. The nucleotide and 
dec need amino ucid sequences ■of 7'. >roW XV N 2 are presented 
in Fig. .V A 7Sll-bp DNA fragment encoding a 22 3-u mi no -acid 
propeptide has been cloned and expressed in S. ivrcvisiac. The 
I Or 1 - hp in iron ptexcnt in the T. nrM'i genomic ATA'2 gene was 
elirtimued thtough the lirsl-Mrand cDNA Mep. The hydropho- 



Best Available Copy 



■i 
I 



vol. 62. mu 



EXPRESSION OF T. REESE! AT/VJ IN .V. CEHEWSME 



If)? 1 ) 



-10*1 aattcgccaaacc -91 

-90 tgaacaDCCccagcacccgaacagccacacaaccccCccjagcccaaaagjcacaacaactcctacnagccgaagcaagaagacatcaac - 1 

1 ATGCTCTCCTTCACCTCCCTCCTCCCCGGCGTCGCCGCCATCTCCGGCGTCTTGGCCGCTCCCGGCCCCCAGGTCGAACCCCTGGCTGTG 
1 M, V S FT S t» L A G V A A I 5 GV L A A P A A E V ■ E F V A V 

31 GAGAAGCGCCAGACGATTCAGCCCGGCACGGGCTACAACAACGGCTACTTCCACTCGTACTGGAACGATGGCCACGGCGGCGTGACGTAC 
31 E K R OT I 0 P GT GYN II G YF H S Y W N D G H G G V T V 

1 8 1 ACCAATGGTCCCGGCGGGCAGTTCTCCGTCAACTGGTCCAACTCGGGCAACrTTGTCGGCGGCAAGGGATGGCAGCCCGGCACCAAGAAC 
61 TNCPGGQFS V N W S US GN FVGGKGWQPGTKN 

T 

271 AAGGTCATCAACTTCTCGGGCAGCTAGAACCCCAACGGCAACAGGTACCTCTGGGTGTACGGCTGGTCCCGCAACCCGCTCATCGAGTAC 
91 KVINFSG SYNP. NGN .SY. L3VYGWSRNPLIEY 
t 

361 TACATCGTCGGGA>\crrTGGCACCTACAACCCGTCCACGGGCGCCACCAAGCTGGGCGAGGTCACCTCCGACGGCAGCGTCTACGACATT 
121 Y I V G \t T G T Y N P S T G A T K L G E V T S D G S V Y D I 

T 

4 5 1 TACCGCACGGAGCGCGTCAACCAGCCGTCCATCATCGGCACCGCCACCTTTTACCAGTACTGGTCCGTCCGCCGC^ACGACCGCTCGAGC 
lil Y RTQRVNQ P S I r G T A T F Y Q Y W S V R R N H R S S 
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541 G G CT CC GT C AA CAC G GCGAACCACTTCAACGCGTGGGCTCAGC A AGGCCTGACGCTCGGG AC GATGGATTACCAGATTGTTGCCGTGG AG 
161 G S V N TAN H F N AWAQQ GLT LGT M DYQ I V AVE 

631 CGTTACTTTAGCTCTGCCTCTGGTTCCATCACCGTCAGCTAAagggngatct 6B2 
211GY FS 5 GSAS T TVS * 22 3 

FIG. 3. Nucleotide and deduced amino acid sequences uf the XYN2 gene from T. reeiei QM 6a. The hydrophobic leader sequence is underlined, the dibasic reiiduci 
(LyvArgj are double underlined, :ind the PCR primers sequences used are indicatcc in boldface type. Residues marked whh arrows are potential luryel!. fur N 
glycrnylation. The nucleotide sequence duta are available from Gen Bank nucleotide sequence databases (accession number U24191). 



bic leader of I he rccombinnnl propeptide produced by S. cer- 
evisiae is presumably proieolytically processed by the KEX2 
protease, producing a 1 90-a mi no-acid mature proicin (Fig. 3). 

Identification and confirmation of furl disruptions. The FUR] 
gene of S. cerevisiae Y294 strains was disrupted with the furl ;: 
LEU2 allele to generate S. cerevisiae Y294{fur!:±EU2 pDLGl). 
S. cerevisiae Y294(fiirl::LEU2 pDLG5). and 5. cerevisiae Y294 
(furl::LEU2 pDLG6). respectively. The fitr l:\LEU2 disruptive 
allele was isolated as a 3.27-kb NcohNsil linear DNA fragment 
for subsequent transformation of 5. cerevisiae strains to replace 
the chromosomal FUR J gene, .through gene replacement (34) 
with the fitrl::LEU2 allele (Fig. 4A). Leu" yeast Iransformants 
were selected for on SC" Leu medium. To confirm the FUR1 
gene disruption, Nsi\-Nco I -digested genomic DNA from £ 
cerevisiae Y294. S. cerevisiae Y 29 A(fnrl:±EU 2 pDLG5), and 
5. ccremiae Y2M(furI :±EU2 pDLG6) were subjected to South- 
ern hybridization analysis with a 1.33-kb Nsil-Nco\ FURI frag- 
ment from pPE as an ct- 32 P-labelled probe. In S. cerevisiae 
Y294. a DNA fragment of 1.33 kb was radioaciively high- 
lighted, and in S. cerevisiae Y 19 A{fur l:\LEU2 pDLG5) and S, 
cerevisiae Y29A{furi::LEU2 pDLG6), DNA fragments of 3.27 
kb were highlighted, corresponding tu the FUR I and fur/:: 
LEU2 alleles, respectively (Fig. 4B). 5. cerevisiae Y294(furl:: 
LEU2 pDLGl) without the p-!tylanase gene was analyzed in 
the same way (data not shown). 

Northern blot hybridization and SUS-PAGE analysis of 
cloned pruteins. Northern analysis of the mRNA produced by 
the recombinant S. cerevisiae strains showed that high levels of 
XYN2 mRNA were produced by 5. cerevisiae Y294(fur/::LEU2 
pDLGS) when cultivated for 48 h in YPD with 2% galactose as 
the carbon source (Fig. 5 A). Much lower levels of mRNA were 
present in .V cerevisiae Y2W4(///r l:\LEU2 pDLG6) cultivated 
for 4H h in YPD with 2% glucose as the carbon source. On the 
SpS-polyacrylamide gel. a new protein band with an estimated 
molecular mass of 27 "kDa was visible for 5. cerevisiae. Y294 
{Jnri::t.!-:U2 plHXtf) (ihe more active clone), as- well as for S. 
cerevisiae Y29A{furlv.LEU2 pDLGft). The high level of ATA/2 
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FIG. (A) Schematic representation shuwing the diMupiiun t»f ihe FORI 
yene cm ihe genuine uf 0-.\vlanu>r-producirit! X iwivi/ur Mrarns. Thc-cnrymcs 
u>ed werolccl (A). CVhI (CJ. £foRI (Rl). Nco\ {Nc). .Yw'l (Ns). Sat\ fSl. and 
A'aol (X). tB) Suuihern hlut analysis Ah nuing the gemimic.divrupiinn m FVRl 
Uncv I, \ t?trvi\ior WW with the wild- type FUR I itv'ncv J. ,V rcmntur 
Y2^{furt::LF.i!^ pDLG.S): *.V rmuwi- \^i{fitrh:f.F.U2 pDLGfi). genomic 
DNA was hybridized tu'iin a- ,: P-lybeUed U.Vkb -FVRl DNA Ir.iymcni. 
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FIG. 6. Time course or extracellular 0-wlanasc activity produced by 5. cer- 
evhiar V2*i4[furi::LEL;2 pDI.C6) (A). .? sewnta* Y2Wifurl::L£U} pDLG5> 
(U(. iiod T one/ (C) in rich medium. Tlic U-ryUnaw activities were aunyed by 
Ihe method uf Uiitlcy ct jI. (I). descrihed in ihe tevt. YcaM cell count* *crc 
determined Wild the aid uf a hcmuL-ytunicter. 



March 1995). The nucleotide sequence of the cDNA fragment 
containing the T. reesei QM oa XYN2 gene (Fig. 3) is 99^ 
identical lo that reported by Torronen ci al. for the T. reesei 
Rut C-30 XYN2 DNA sequence (GenBunk accession number 
S51973) (44). differing for only six base pair substitutions and 
three base pair insertions. The three C-G insertions are at nu- 
cleotides 29. 55, and 59 of the T. reesei QM 6a XYN2 DNA 
sequence, resulting in amino acids 10 to 19 in the proproiein 
being different from (and more hydrophobic than) those in the 
proprotein encoded by the T. reesei Rut C-30 XYN2 genc. The 
cDNA sequence of the, T. reesei QM 6a XYN2 gene is identical 
to the corresponding nucleotide sequence of T. tvesei VTT-D- 
79125 (GenBank accession number S673K7) reported by 
Saurcluinen el al. (36). except for three base pair substitutions 
that occurred within the coding region of the XYN2 gene. 
These three base pair substitutions also correspond to similar 
base pair substitutions when compared with the T. reesei Rut 
C-30 A'VA/2 nucleotide sequence, suggesting I hut they are mis- 
incorporations by the Taq DNA polymerase during the PCR 
amplification of the T. reesei QM .6u AYN2 cDNA fragment 
(12). The -XYN2 nucleotide sequences of boih T. rccsci OM ha 
and T. nvsei VTT-D-79125 are probably identical if mismeor- 
pora I ion during the PCR amplification is taken into account. 

Buth T. reesei Rut C-30 and T. rvesci VII -0-79125 are mu- 
tants isolated in mutant selection programs at the U.S. Army 
Natick Laboratories. Natick, Maw, (2-U; Rutgers University, 



Rutgers. N.Y. {401; and VTT-D Technical Research Center. 
Espoo. Finland (3). thai started with T. ree.set QM f>a as the 
wild-iype strain (Fig. K). Our sequence dala suggest thai no, 
mutations were introduced into the XYN2 gene of T. reesei 
VTT-D-79125 during the three ^-irradiation steps, four /V- 
nitro-A r -nitro-/v^nitr6soguanidine steps, and one diethyl sul- 
fate mutagenesis step that were performed prior to its isola- 
tion. However, the AY/V2 gene of T. reoW Rut -C-30 most 
probably acquired the three C-G deletion mutations during, 
any of the UV radiation or two /V-nitro-zV-niiro-AZ-nitrosogua- 
nidine mutagenesis steps used in the mutant selection program 
at Rutgers University, resulting in a Ic^s hydrophobic propro- 
lein when compared with T. recsei QM ha and T. reesei VTT- 
D-79125; 

The ATM? gene was inserted between a yeast promoter and 
transcription terminator on multicopy episomal plnsmids to.' 
achieve high levels of gene expression (32). For this purpose, 
we used the 5. cerevisiae ADH2 and TO^/ promoter- termina- 
tor cassettes. Transcription of ADH2 is almost undetectable 
when S. cerevvsiae is grown on fermentable sugars such as 
glucose or galactose but is dercpresscd to a level representing 
about \% of total soluble cellular protein when the yeast is 
grown on nonfermentable carbon sources or fermentable sug- 
ars at concentrations lower than 1% (32). The level of XYN2 
expression under the control of the ADH2 promoter was 
higher when the yeast transformants were grown on galactose 
instead of glucose as the carbon source (Fig. 2). The PGKl 
promoter is a strong constitutive promoter which can be in- 
duced to a level of expression that constitutes 4 to 10% of the 
total soluble protein depending on the growth conditions (15). 
The difference be rwcen the two promoters used can clearly be 
S2cii in Fig. 2. The A DH2 promoter is strongly repressed when 
1 le celts arc plated on glucose, resulting in a large difference in 
nc size of the clearing zones on the plate containing glucose. 

Translation in yeasts can be modulated at the level of initi- 
ation by four aspects of mRNA. structure: (i) the primary 
sequence or context surrounding the AUG codon; (ii) the 
position of the AUG codon, i.e., whether it is first; (iii) scc- 
endary structure both upstream and downstream of the AUG 
codon; and (iv) leader length (20) The leader sequence of the 
T. reesei XYN2 genc is similar lo that of most S. cerevisiae genes 
(20). Therefore, initiation of translation of AY/V2 would be 
effective in 5. cerevisiae. This degree of similarity between 5. 
arcvisuw and T. reesei might be because both these organisms 
belong to the subdivision Ascomycotma. 

High-level secretion of heterologous gene products in 5. 
cerevisiae is mediated by the hydrophobic N- terminal extension 
of the polypeptide, the signal/leader sequence. The leader pep- 
tide is responsible for translocation of the protein to the en- 
doplasmic reticulum, from where the protein is transported 
through specialized secretory organelles, modified, processed, 
and often glycosylated prior to its release by the plasma mem- 
brane into the culture medium. The T. reesei QM 6a AY7V2 
kader peptide has all the characteristics required for process- 
ing in 5. cerevisiae. It is hydrophobic and possesses the dibasic 
residues Lys-Arg (Fig. 3), which are presumably recognized 
and cleaved on the carboxyl side by the KJEX2 protease of A. 
mrevisiae (32). 

However, ihe Xyn2 |3-.\ylanase secre:ed by S: ccrevistueAw a 
different molecular mass from that of the T. reesei enzyme 
(Table 2). The mature [i-xylanase'produccd by 7". tvesei has a 
molecular mass of 21 kDa as deduced from the amino acid 
sequence. The molecular mass determined by SDS-PAGE is 
i.lmtist the same, indicating u virtual absence oi' carbohydrates 
(44). The molecular mass of the. enzyme secreted by A", ir/rW- 
sjuc is 27 kDa as cm i mated by SDS-PAGE. There is- thus a 
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tcimiv of Xyn2. Ttic highcil Activity was measured M pH 6 and M 60'C respectively. The buffer* used m thv 
site phtaphulc (pH 4,0 lo 7.01. and 50 m.M phusphntc (pH 8.U). (C) The icmpemtuic stability or Xyn2 
c of ihc substrate fur 30 min ;u 50. oO. and 70"C (D) The ihcrmouabilii> of Xyn2 ai oO'C determined 
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6-kDa difference in the molecular masses of the p-xylanases 
secreted by S. cercvisiac and T. reesei. This is caused by N- 
glyeosylution of the p-xylanuse secreted by S. cercvisiac (32), 
because 5. cerwisiae lends to "hypcrglycosyiate" heterologous 
proteins. Treatment of the recombinant Xyn2 protein with 
cndoglycosidasc F generated a new protein species with a 
molecular mass of 21 kDa (Fig. 5C). which corresponds tn that 
of native fi-xylnnasc produced by 7'. rccsei. However, this large, 
glycosylated protein (22% sugar content) was efficiently se- 
creted and passed through the yeast cell wall into Ihc culture 
medium without any effect on the growth rate. The pH and 
temperature optima of the recombinant P-xylanase produced 
by 5. cercvisiac compare well with those of the native enzyme 
produced hy T. reesei (Tahle 2). Similar hyperglycosylatiim was 
observed with the ccllobiohydrolascs (CBHI and CBHI1) of T. 
recsei when expressed in S. cercvisiac (2K). 

The XYN2 gene is expressed from an episomal plasmid, and 
the yeast must he kept under selective conditions to ensure 
vector stability. However, (he use of selective synthetic medium 
is not ideal for the production of high levels of heterologous 
proteins. We therefore genetically altered the recombinant 
ycitst strains hi allow authselcetiun for the episomal plasrnids. 
The l-Ulil gene ol'.V. irrvvisiae encodes uracil phosphoribosyl- 
trunsferasc. which catalyzes the conversion of uracil into uri- 



dine Diphosphate in the pyrimidine salvage pathway (19). It 
this gene were disrupted. S. cercvisiac Y294 would not be able 
to utilize urucil from the extracellular medium and would 
therefore not be viable, unless it possessed a complementing 
functional VRA3 gene to synthesize uridine .V-phosphatc dc 
novo. In this case, the UHAJ gene is the yeast selectable: 
marker on the YEp352*based vectors used for the expression 
of XYN2. After the FUR J gene in the [J-xylunuse-producing 5. 
cerwisiae strains was disrupted, these strains could be cultured 
in YPD medium or any other complete synthetic medium 
without the risk of losing the episomal plasm id. S. cerevisiar 
could grow to much higher cell densities when cultivated on 
YPD medium; therefore, p-xylanase activity was considerably 
higher. In S. cercvisiac \29Mfttr!.:LEU2 pDLG5) and 5. aw- 
visiae Y29Mfurl\:LEU2 pDLG6), p-xylanase activity increased 
24- und 3-fold, respectively, when the cells were cultured on 
YPD medium instead of selective SC medium (data not shown), 
lite time needed for p-xylanase production to start also de- 
creased substantially. 

The highest total p-xylunase uctivity obtained in shake flasks 
for the hyperprnducine mutant 7*. recsei VTT-D-86271 (Rut 
C-3H) was 5.40(1 nkitt/ml (2). The total activity obtained for the 
most active rccomhinanl .V cerevishte strain (1.487 nkat/ml) 
compares well with ihal of /'. recsei Rm C-3U if one takes into 
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account that the T. rccsei culture supernatant contains the 
complete battery of enzymes involved in x^Ian degradation. 
These include (J-xyluiridascs that degrade the short \ylo-ojigo- 
saccharides produced by the p-xylunases and debranching en- 
zymes like a -glucuronidases. The cumulalive and synergistic 
action of all these enzymes would therefore, lead to larger 
amounts of reducing sugars being produced compared with a 
culture with only p-xylanase activity. 

Although p-xylana.se production by recombinant organisms, 
such as S. ct'/vvisiae. may not be as high as that in organisms 
such us the uboyc-mcntioncd T. rccsei strain, it might still be 
worthwhile to consider its use for industrial enzyme produc* 
tion. First, S. cerevisiue cannot degrade cellulose, xylan, or any 
of the other polymers in wood; therefore, p-xylanases pro- 
duced by this organism are pure and completely Tree of any 
contaminating celluloses. This is especially important for use in 
the paper industry. Currently, commercial p-xylanase prepara- 
tions marketed for pulp treatment include Pulpzymc HA from 
T. reesei (47). This is a relatively crude (i-xylanase preparation 
with residual ce.llululytic activity and requires careful control uf 
process parameters to avoid damage to fibers. Second, 5. cer- 
evisiue can be cultivated on a variety of relatively inexpensive 
culture media without the need for xylan to induce (J-xylanasc 
production. Furthermore, industrial-scale fermentation tech- 
nology for 5. cerevisiue is well established. 

Future research in our laboratory will be directed toward the 
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development of a more effective expression system for the 
production of fi-xylunascs in S. ccrcyisiac. The addition of a 
p-xylosidase gene and/or gcne(s) encoding debrunching cn* 
zvme(s) will also be considered. 
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Directed evolution to produce an alkalophilic 
variant from a Neocallimastix patriciarum xylanase 

Yew-Loom Chen, Tsung-Yin Tang, and Kuo-Joan Cheng 



Abstract: The catalytic domain of a xylanase from the anaerobic fungus Neocallimastix patriciarum was made more 
alkalophilic through directed evolution using error-prone PCR. Transformants expressing the alkalophilic variant 
xylanases produced larger clear zones when overlaid with high pH, xylan-containing agar. Eight amino acid substitu- 
tions were identified in six selected mutant xylanases. Whereas the wild-type xylanase exhibited no activity at pH 8.5, 
the relative and specific activities of the six mutants were higher at pH 8.5 than at pH 6.0. Seven of the eight amino 
acid substitutions were assembled in one enzyme (xyn-CDBFV) by site-directed mutagenesis. Some or all of the seven 
mutations exerted positive and possibly synergistic effects on the alkalophilicity of the enzyme. The resulting compos- 
ite mutant xylanase retained a greater proportion of its activity than did the wild type at pH above 7.0, maintaining 
25% of its activity at pH 9.0, and its retention of activity at acid pH was no lower than that of the wild type. The 
composite xylanase (xyn-CDBFV) had a relatively high specific activity of 10 128 u,mol glucose-min" 1 (mg protein)" 1 at 
pH 6.0. It was more thermostable at 60°C and alkaline tolerant at pH 10.0 than the wild-type xylanase. These proper- 
ties suggest that the composite mutant xylanase is a promising and suitable candidate for paper pulp bio-bleaching. 

Key words: xylanase, Neocallimastix patriciarum, alkalophilicity, random mutagenesis, directed evolution. 

Resume : Lc domaine catalytique d'une xylanase du champignon anae'robe Neocallimastix patriciarum a ete rendu plus 
alcalophiliquc par evolution dirigfSe en utilisant un PCR a retranscription imparfaite. Les transformants exprimant des 
variantes de xylanases alcalophiliques ont produit des zones claires plus Vendues lorsqu'elles etaient recouvertes avec 
un agar-xylane a pH Heve*. Nous avons identify huit substitutions d'acides amines dans six xylanases mutantes. Alors 
que la xylanase de type sauvage ne manifestait aucune activite" a un pH de 8,5, les activity's relatives et spe'eifiques des 
six mutants e'taient plus eleve'es a pH 8,5 qu'a pH 6,0. Sept des huit substitutions d'acides amines ont ete assemblies 
dans une enzyme (xyn-CDBFV) par mutagenese dirigee. Certaines ou toutes les sept mutations ont eu des effets 
positifs et vraisemblablement synergiques sur I'alcalophilicite' de I'enzyme. Le mutant composite de la xylanase qui en 
est de'coulc' conservait une plus grande proportion de son activite* que I'enzyme de type sauvage a un pH superieur a 
7,0; elle a maintenu 25 % de son activite* a un pH de 9,0, et la conservation de son activite* a des pH acides n'e'tait pas 
inferieure a celle du type sauvage. La xylanase composite (xyn-CDBFV) avait une activity specifique relativernent 
eleve*e de 10 128 umol g)ucosemin~ l (mg proline) -1 a un pH de 6,0. FJIe e*tait plus thermostable a 60°C et plus 
tolerante a Talcalinite a pH 10,0 que la xylanase de type sauvage. Ces proprie'tes indiquent que la xylanase composite 
represente un candidat prometteur adapte au bio-blanchiment de la pate a papier. 

Mots ctts : xylanase, Neocallimastix patriciarum, alcalophilicite\ mutagenese aleatoire, evolution dirigee. 
[Traduit par la Re'daction] 



Introduction 

Endo-P-l,4-xylanase (EC 3.2.1.8) catalyzes the hydrolysis 
of xylan, a plant cell wall heteropolysaccharide that contains a 
backbone of 1,4-linked xylose residues (Kulkarni et al. 1999). 
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Xylanases have been used industrially to bio-bleach paper 
pulp (Buchert et al. 1994; Bajpai 1999), as an animal feed 
supplement (Uhlig 1998), to clarify fruit juice (Biely 1985), 
and to enhance the quality of baked products (Maat et al. 
1992). For environmental reasons, xylanases are desirable in 
that they reduce the amount of chlorine and chlorine dioxide 
used for bleaching paper pulp. During the bleaching of kraft 
paper pulp, the lignin in wood chips is removed by sequential 
treatments with chlorine, chlorine dioxide, and NaOH (Viikari 
et al. 1994). The chlorine and chlorine dioxide create persis- 
tent organic chemicals that are toxic to organisms in the wa- 
terways close to paper plants and may present health risks to 
humans as well. Pretreating paper pulp with xylanases can en- 
hance the efficiency of the chemical extraction of lignin 
(Viikari et al. 1994; Bim and Franco 2000; Georis et al. 2000) 
and so reduce the amounts of chlorine and chlorine dioxide 
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Fig. I. Amino acid sequence of xyn-CDAVT xylanase expressed' 
in pGEX-4Tl vector. Every' fifth amino acid (numbers immedi- 
ately above) is depicted in bold type. The eight amino acids de- 
picted in bold, italicized type (numbers 24, 97, 128, 164. 168, 
208, 214, and 218) are those substituted by others in the 
alkalophilic mutants listed in Table I. 
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required. Because pulp bleaching entails high temperatures 
and high pH, the optimum xylanase for use in this application 
would be thermophilic, thermostable, alkalophilic, and stable 
in alkaline environments (Shoham et al. 1992; Bim and 
Franco 2000). 

The potential usefulness of these enzymes in industry has 
spurred considerable research efforts toward producing more 
thermostable, thermophilic, and alkalophilic xylanases. Two 
approaches have been taken: screening organisms from vari- 
ous sources in nature for high xylanase activity, and engi- 
neering known xylanases to improve their characteristics. 
Possibly, both approaches are necessary to obtain ideal 
xylanases. Thermophilic xylanases have been isolated from 
the hyperthermophilic bacterium Thermotoga mahtima 
(Simpson et al. 1991), from Bacillus spp. (Harris et al. 1997), 
and from the thermophilic fungus Thermomyces kmuginosus 
(Gruber ct al. 1998). These xylanases are insufficiently 
alkalophilic for bio-bleaching, however. Their pH optima are 
between 5.5 and 6.0 compared with pH 9.0, which is the 
ideal (Bim and Franco 2000). 

Random mutagenesis (error-prone PCR) has been used 
successfully to improve specific properties of enzymes, particu- 
larly thermostability. Giver et al. (1998) made a thermostable 
esterase using this technique together with recombination. In 
the same way, Zhao and Arnold (1999) made a thermostable, 
thermophilic thermitase. Error-prone PCR has been used to 
increase the thermostability (and specific activity) of a 
phospholipase Al (Song and Rhee 2000) and a cholesterol 
oxidase from Streptomyces (Nishiya et al. 1997). The 
thermostability of a xylanase was increased through chemi- 
cal (random) mutagenesis (Arase et al. 1993). Research into 
improving alkalophilicity of enzymes has been more limited, 
however. Russell and Fersht (1987) shifted the pH-activity 
profile of subtilisin by site-directed mutagenesis based on 
rational theoretical design. The pH optimum of a t>xylose 
isomerase was shifted downward 0.5 to 0,8 units by similar 
rational design (Cha and Batt 1998). 



A xylanase from the ruminal fungus Neocallimastix 
pairiciarum was characterized (Tamblyn Lee et al. 1993), 
and the gene encoding it was subcloned, sequenced, and 
subjected to structural analysis (Liu et al. 1999). This study 
was conducted to explore the possibility of using random 
mutagenesis and site-directed mutagenesis to shift the pH 
optimum of the N. patriciantm xylanase from acid (pH 6.2) 
to alkaline pH and (or) to increase its specific activity at al- 
kaline pH to render the enzyme better suited to the harsh 
conditions of paper pulp bio-bleaching. 

Materials and methods 

Construction of plasmid pxyn-CDAVT 

The catalytic domain of the xynC xylanase gene from 
N. pairiciarum (Liu et al. 1999) was amplified for expression using 
the plasmid pGEXxynC as a template and the oligonucleotide 
primers 5'-xynl (5'-GAGAGAATTCATGCTTGCTCAAAGTTTC- 
TGT-3') and 3'-xyn2 (5'-CGCGCTCGAGATCACCAATGTAAA- 
CCTT-3'). The PCR product was digested with EcoR\ and Xho\ 
and cloned as a trans lationaJ fusion into EcoR\-, Xho\ -digested 
pGEX-4T-l plasmid (Pharmacia Biotech Inc., Piscataway, N.J.). 
The resultant expression vector was named pxyn-CDAVT. When 
expressed in Escherichia coli the gene product of the catalytic do- 
main of xynC. designated as xyn-CDAVT, became a fusion pTotein 
with glutathione S- transferase. Thrombin cleavage allowed release 
of the recombinant xyn-CDAVT protein, which has eight extra 
amino acid residues (GSPEFMLA) at the N-tcrminal and seven 
residues (LERPHRD) at the C-terminal (Fig. I). 

Error-prone PCR and construction of the libraries of 
variant xylanase genes 

To generate a library of variants of the gene encoding xyn- 
CD/WT we introduced mutations using error-prone PCR. The re- 
action mixture comprised 50-100 ng pxyn-CDAVT, 50 pmol for- 
ward primer (xynl), 50 pmol reverse primer (xyn2). and 2 U of 
Taq DNA polymerase (Promega, Madison, Wis.) in 100 uL of Ix 
reaction buffer. It also contained 1.5 mM MgCk 0.2 mM dCTP, 
0.2 mM dTTP, 0.04 mM dATP, and 0.04 mM dGTK Thermocycler 
settings (Touch Down™ Temperature Cycling System. Hybaid. 
Middlesex, U.K.) were 94°C lor 30 s, 60°C for 30 s, and 72°C for 
30 s (50 cycles). The PCR products (-700 hp) were isolated from a 
1% agarose gel and purified, using a Q1AEX II gel extraction kit 
(Qiagen. Valencia, Calif.), and were then cloned and ligated with 
T4 DNA ligase (Promega) into toRI-, X/ioI-digested pGEX-4T-l 
vector. The resulting plasmids were transferred into E. coli BL21- 
DE3 (Novagen, Madison, Wis.) at 42°C heat shock transformation. 

Screening for alkalophilic xylanase variants 

Transformed cells derived from the library of variants of xyn- 
CDAVT were plated onto Luria-Bertani (LB) agar plates (Miller 
1972) containing ampiciilin (100 ug/mL) and were incubated at 
37 °C for approximately 12 h. Colonies were overlaid with molten 
1% agar (50°C) in 0.1 M disodium tetraborate buffer containing 
0.1% oat spelts xylan (Sigma Chemical Co., St. Louis, Mo.). The 
pH of the overlay buffer was adjusted to 9.0 during the first selec- 
tion cycle and to 10.0 during the second selection cycle. After 
overnight incubation at room temperature, the plates were stained 
with 1% (w/v) Congo red (Sigma) for 1 h and destained with 1 M 
NaCl solution for 2 h (Teather and Wood 1982). The colonies with 
relatively larger halos (i.e.. zones of xylan degradation) were se- 
lected for subculturing; they were the strains containing plasmids 
expressing xylanase variants with increased alkalophilicity. The se- 
lected clones were DNA sequenced through a Taq DyeDeoxy™ 
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mRNA produced by 5. ttreWi/V/e Y2M{fiirl::LEU2 pDLG5) 
probablv leads to higher levels of enzyme compared with those 
in 5. ceivvisiae YZ94(furl::L£U2 pDLCG) (Fig. 5B). Treatment 
of the 27-kDa protein species with endoglycosidase F gener- 
ated a new protein species of 21 kDa that corresponds to the 
molecular mass of native Xyn2 xylanasc isolated from 7*. reesei 
(44) (Fig. 5C). 

P-Xylannse activity. The p-xylanase*producing yeast strains 
and T. reesei QM 6a were analyzed for their ability to secrete 
biologically active p-xylnnases over 125- and 250-h periods, 
respectively (Fig. 6). For optimal production of poiylanasc, S. 
ccrwisiae Y294(fi4rt;:LEU2 pDLG6) was cultured on YPD me- 
dium with 0.8% glucose (15), S. ceremiae Y294(fitrl\:LEU2 
pDLGS) was cultured on YPD medium with 0.8% galactose, 
and T. rvi'se't was cultured on basal medium containing 0.3% 
oat spelts xylan (25). Lower concentrations of glucose and 
galactose were used to reduce the reducing-sugar background 
;it the start of the growth curve, enabling the assessment of 
|i-xylanase activity in the culture Nupurhuiants. The highest 
(3-xylanase activity for S. cerwisiac \294{Jhr/::LEU2 pDLGo) 
and .V. ccrvmiae \294{furl::LEU2 pDLG5) was recorded after 
cu. 70 iind ca. HO h. respectively (Fig. hA and B). S. avuW.vwe 
\ , lW{[url\±tiU2 pDLG6) showed a peak p-xylanase activity 
or I nil nkat/inl. and S. ctrrvisine Y2 i )4ij'ttri::LEU2 pDLG5j 
showed a activity nf 1,2110 nknt/ml. P-Xylanase activity was 
iL-tceted earlier in .V ivrrrixitw Y2M(furl::U-;U2 pl)I.G6), as a 
result of the coiiMituiivc VClKf promoter, it also reached a 



higher cell density in a shorter period than in S. cereviside Y2U4 
ifurl::LEU2 pDLG5). probably because glucose is preferred to 
galactose as a carbon source for 5. cerevisiac growth, in T. 
newt QM 6a, p-xylunase activity was delected after ahout 65 h. 
with the highest levels measured at ca. 450 nkat/ml after about 
180 h (Fig. 6C). 

Effects of pH and temperature on p-xylnnase activity. The 
recombinant p-xylanase activity peaked between pH 4 and 6. 
with the highest activity measured a I pH 6 in 50 mM citrate 
buffer (Fig. 7A). The optimum temperature for this enzyme 
was at 60°C (Fig. 7B). Although the highest p-xylanase activity 
was measured at 60°C. the enzyme is not stable at this tem- 
perature (Fig. 7C). The p-xylunasc activity decreased by more 
than 50% after a 5-min incubation at 60°C; less than 10^c 
activity could be measured after a 10-min incubation (Fig. 7D). 
However, the recombinant enzyme is relatively stable at 5U°C: 
more than 90% activity remained after 3U min at this temper- 
ature (Fig. 7C). 

DISCUSSION 

mRNA wa.N isolated from the xylanolytic fungus T. reesci. 
and the XYN2 gene encoding the main p-xylanase, Xyn2, was 
amplified with the aid of sequence-specific PGR primers. The 
ON A sequence was verified and compared with the DNA 
sequence published by Torriinen el a!. (44) and with. the DNA 
sequences available in the GcnDunk database (release H7.0. 13 
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Tabic 1. Designations, amino acid substitutions, and specific activities of the wild-iype and mutant xylanascs from 
Ncocallimastix parriciarum . 



Xylanase 


Source 


Amino acid 
substitutions 


Specific activity* at 47°C 
pH 6.0 1 pH 8.5' 


tu tiviivity 

retained 


xyn-CD/WT 


Wild-type 


None 


4 343 


0 


u 




xylanase 




(98.5) 


- 




xyn-09D 


First round of 


I97S 


3 460 


47 


1 , JO 




PCR 




(83.5) 


(7.3) 




xyn-C2 


Second round 




2 581 


97 


1 OA 




of PCR 






(45.2) 




xyn-D8 


Second round 


I97S, N164D 


4 no 


252 


U. IJ 




of PCR 




(102.0) 


(15.9) 




xyn-D2 


Second round 


T97S, SI28F, L208F 


2 831 


306 


10.81 




of PCR 




(54,5) 


(35.8) 




xyn-B5 


Second round 


I97S, E24V 


3 710 


404 


10.89 




of PCR 




(164.2) 


(26.4) 




xyn-C3 


Second round 


I97S, V218A 


3 934 


534 


3.57 




of PCR 




(123.4) 


(42.9) 




xyn-CDBFV 


Site-directed 


J97S, K168N, G2I4V, 


10 128 


4542 


44.85 




mutagenesis 


N.164D, S128F, E24V, 


(302.8) 


(315.2) 








V2I8A 









♦Specific activity is expressed as micromoles of glucose released from oat spells xylan per minute per milligram of protein under 
standard assay conditions (10 min; \% (w/v) substrate). Values shown are the means of quadruplicate determinations, -except xyn-C3 
(pH 6.0) and xyn-C2 fpH 8.5), which were determined in triplicate, and xyn-09D (pH 8.5). which was determined in duplicate. 

'Values in parentheses are standard deviations of the means immediately above. 



Terminator Cycle Sequencing kit (Applied Biosys.tems, Poster 
City, Calif.). The gene products (xylanases) from those variants 
possessing amino acid mutation(s) were purified, as described be- 
low, for the pH-activit.y profile assay. 

Two step selection process 

From the first mutational library constructed and screened at 
pH 9.0, one clone (E. coli xyn-09D) was selected based on the 
size of its xy lan-clearing zone (halo). The xylanase from this clone 
was purified, as described below, and its xylanolytic activity was 
assessed at pH from 4.0 to 9.0. The pH activity profile con tinned 
that the alkalophilicity of the enzyme was increased over that of 
the wild type. The xylanase -encoding gene from this mutant was 
then used as template for a second round of error- prone PCR, pro- 
ducing a second mutational library. Screening of this second li- 
brary for xylanase activity at pH 10.0 yielded an additional five 
mutants even more alkalophilic than £. coli xyn-09D 

Site-directed mutagenesis 

DNA sequencing of the six mutant xylanase genes (xyn-09D, 
xyn-C2, xyn-D8, xyn-D2, xyn-B5» and xyn-C3; Table 1) revealed 
amino acid substitutions at eight sites in the genes (Fig. .1). Using 
site-directed mutagenesis, these eight mutations were assembled 
into a series of composite mutant xylanases, which had various 
combinations of the eight mutations. Site-directed mutageneses 
were performed, using the Ex Site™ PCR-based site-directed muta- 
genesis kit (Stratagcne Cloning Systems, Inc., La joila, Calif.). 
Complementary primers for PCR containing the desired DNA 
changes were constructed and included in reaction mixtures in 
which 40 pL distilled H 2 O t 5 pL I Ox pfu-turbo polymerase buffer, 
1 jiL forward primer (125 ng/jjL), 1 pL reverse primer (125 ng/L), 
1 pL dNTPs (10 mM), 1 pL plasmid (10-50 ng), and 1 pL pfu- 
turbo polymerase (2.5 \Jf\\V) were combined in a total reaction vol- 
ume of 50 pL. Thermocyeler settings (TouchDown™, Hybaid) 



were as follows: 95 °C for 4 min. then 15 cycles at 95 °C for 30 s. 
5S°C for 30 s. and 68°C for 13 min. The final primer extension 
was conducted at 70°C for 10 min. Dpn\ ( I pL) was added to the 
reaction, followed by a further incubation at 37°C for 2 h. The re- 
sulting plasmids directly from the final reaction mixture were 
transformed into competent cells of E. coli DH5a. and the resultant 
colonies were DNA sequenced to identify the clones containing the 
desired substitutions. 

Protein purification and pH-activity profiles 

Escherichia coli BL21-DE3 was used for overexpression of the 
xylanases. Cells containing the plasmid of interest were cultured 
overnight in LB broth containing ampicillin (100 pg/mL). then di- 
luted 1:100 in fresh broth to generate 100 mL of culture and incu- 
bated for a further 2 h. The cells were induced by adding 100 pL of 
0.1 M stock solution of isopiopyl-p-D-thiogatactopyranoside 
(0.1 mM) and further incubated at 37°C for 2-3 h. Following in- 
duction, the cultures were centrifuged (7700 x g: 10 min; 4°C), 
and the pellets were suspended in 5 mL of single-strength phos- 
phate buffered saline (Ix PBS) for lysis by sonication (20 pulses of 
10 s). The lysates were centrifuged (12 000 x g\ 10 min; 4°C) and 
5-mL aliquots were combined with 200 U.L of glutathione 
sepharose 4B slurry (Pharmacia) for 2 h to bind the GST-fusion 
proteins. The resultant beads were washed four or five limes with 
PBS, which excluded almost all nonspecific binding as determined 
by SDS-PAGE assay. The target GST- fused xylanases were re- 
leased from the beads by incubation for 7-12 h at room tempera- 
ture with 3.15 pL of PBS buffer, which contained 15 U of thrombin 
protease (Pharmacia). Protein concentrations in the resulting sam- 
ples were determined, using a Bio-Rad protein assay kit (Bio-Rad 
Laboratories, Richmond, Calif.), against a bovine serum albumin 
standard. Activities of the xylanases were determined at pH rang- 
ing from 4.0 to 9.0 under standardized assay conditions using the 
following three buffers: 0.05 M citric acid, for pH 4.0, 4.5, 5.0, 
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Fig. 2* Activity profile of xyn-CD/WT xylanase at pH 4.0-9.0, 
determined in three 0.05 M buffers (citric acid, for pH 4.0-6.0; 
sodium phosphate, for pH 6.0-8.0; and Tris-HCI, for 
pH 8.0-9.0). Activity was expressed as micromoles of glucose 
equivalents released from xylan per milligram protein per 
minute under standard assay conditions. The highest activity, 
observed in pH 6.0 citric acid buffer, was considered as 100% 
activity, and others are plotted here as percentages of that ac- 
tivity. Values shown are the means of quadruplicate determina- 
tions. Bars indicate standard deviations. Where not visible, bars 
fall within symbols. 

!20 -i — 1 




3456789 10 
PH 

5.5, and 6.0; 0.05 M sodium phosphate, for pH 6.0, 6.5, 7.0, 7.5, 
and 8.0; and 0.05 M Tris-HCI for pH 8.0, 8.5, and 9.0. Substrate 
(oat spelts xylan, X-0627, Sigma) was included in each buffer at 
\% (w/v). Incubations (1 00 (XL total volume) were conducted at 
47 °C for 10 min, then were terminated by adding 50 JJL of stop- 
ping solution (a combined aqueous solution of 25% (w/v) K*,C0 3 
and 5% (w/v) Na 2 S 2 0j), followed by 50 \iL of 0.3% (w/v)" 3,6- 
diniirophthalic acid. The resultant mixtures were boiled for 10 min, 
stored on ice for at least 5 min, and their optical densities at 
450 nm were measured with a MultiSkan mierotitre plate reader 
(Labsystem, Helsinki. Finland). The OD 450 values were converted 
to micromoles of glucose equivalents, using a standard curve of 
glucose versus OD 450 . Amounts of xylanase used ranged from 
0.0035 lo 0.008 ng protein in the 100-^L reaction mixture, depend- 
ing on the specific activity of each xylanase. 

Assay of resistance of xyn-CDBFV xylanase to an 
alkaline environment 

The xyn-CDBFV xylanases were incubated at room temperature 
in 0.1 M 2-amino-2-methyl-l-propanol at pH 10.0 (0.0324 ua pro- 
tein in 50 uL buffer) for 0, 2, 4, 6, 12. and 24 h, then 50 uL of 2% 
(w/v) xylan solution was added. Incubation was continued at 47°C 
for 10 min. then xylanase activity (measured as release of glucose 
equivalents) was determined, as described above. Activities in the 
0-h samples were considered as 100%, and activities at the other 
incubation time points were expressed as percentages of this con- 
trol. 

Assay of thermostability and temperature optima of the 
xylanases 

Thermostabilities of xyn-CDAVT and xyn-CDBFV xylanases 
were determined by incubating the enzymes in 50 u.L of 0.1 M cit- 
ric acid buffer (pH 6.0) at 55 or 60°C for 10, 20, 40. 60, 80, 100, 
or 120 min, chilling on ice for at least 20 min. and then determin- 
ing xylanase activity, using the same enzyme concentrations as 



Fig. 3. Activity profile of xyn-CDBFV xylanase at pH 4,0-9.0, 
determined as described for xyn-CD/WT in Fig. 2. The highest 
activity (in citric acid buffer, pH 6.0) was considered as 100% 
activity, and others are plotted as percentages of that activity. 
Values shown are the means of quadruplicate determinations. 
Bars indicate standard deviations. Where not visible, bars fall 
within symbols. 

120 -, 1 
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were used for pH optima. The cooled samples were equilibrated to 
room temperature for 20 min prior to xylanase assays. Incubation 
conditions were those described previously: 10 min at 47°C in 
0,05 M citric acid buffer (pH 6.0) containing 1% (w/v) xylan. The 
temperature optima of the two xylanases were determined by incu- 
bating them in 0.05 M citric acid buffer (pH 6.0) containing \% 
xylan for 10 min at 42, 47, 52. 57, 62, 67, 72, and 77°C. Reactions 
were terminated and activities measured, as described above. 



Results and discussion 

Evolution of a more alkalophilic xylanase, xyn-CDBFV, 
by random mutagenesis and site-directed mutagenesis 

Error-prone PCR conducted on the xyn-CD/WT xylanase 
gene cloned into pGEX-4T-l produced a mutation library 
from which mutant forms of the wild -type /V. patriciarum 
xylanase with enhanced alkalophilicity were effectively 
identified by agar overlay screening. The single clone se- 
lected from the first mutation library. xyn-09D. displayed 
enhanced alkalophiiicitv relative to the parent peptide, xyn- 
CD/WT (Fig. 2 and Table .1). At pH 8.5 and 47°C, the spe- 
cific activity of xyn-09D was 47 fumol glucose-min'^mg 
protein)" 1 , whereas xyn-CDAVT exhibited no activity at that 
pH. Agar overlay screening (at pH 10.0) of a second muta- 
tion library constructed from the xyn-09D gene yielded five 
additional mutants more alkalophilic than xyn-09D (xyn- 
C2, xyn-D8 xyn-D2, xyn-B5, and xyn-C3; Table 1). Site- 
directed mutagenesis was effective for compiling the eight 
amino acid substitutions identified by DNA sequencing of 
the six mutants into a series of composite mutant xylanases. 
One of the composite mutant xylanases, xyn-CDBFV (with 
seven amino acid mutations), proved to be substantially 
more alkalophilic than any of the six mutants produced by 
error-prone PCR (Table 1). 
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Fig. 4, Temperature profiles of xyn-CD/WT (•) and xyn- 
CDBFV (■) xytanases, determined in JO-min incubations at 
pH 6.0 (citric acid buffer). For each xylanase, activities are 
shown as percentages of the maximum activities, recorded at 
47°C for xyn-CD/WT and at 62°C for xyn-CDBFV. Values 
shown are means of triplicate determinations. 
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pH profiles of the eight mutant xylanases 

The eight purified xylanase proteins, xyn-CD/WT, xyn- 
09D, xyn-B5, xyn-C2.xyn-D2, xyn-D8. xyn-C3, and xyn- 
CDBFV (Table 1) were assayed for xylanase activity at pH 
between 4.0 and 9.0. The pH optimum of the wild-type 
xylanase, xyn-CD/WT, was 6.0 (Fig. 2). Jts activity was re- 
duced to only about 10% of the optimum at pH 7.5 and was 
inhibited entirely at pH 8,5 or 9.0. The highest activity of 
composite xylanase, xyn-CDBFV, was also observed at 
pH 6.0 (Fig. 3), but in contrast to xyn-CD/WT, this mutant 
retained about 60% of its highest activity at pH 7.5 and 
maintained about 45% and 25% of its activity at pH 8.5 and 
9.0, respectively. It can therefore be concluded that xyn- 
CDBFV is substantially more alkalophilic than is xyn- 
CD/WT. Specific activities of the six PCR- generated mu- 
tants at pH 8.5 and their relative retentions of activity at 
pH 8.5 versus pH 6.0 were intermediate to the wild type 
(xyn-CD/WT) and the composite mutant (xyn-CDBFV) 
xylanases (Table I). The specific activity of xyn-CDBFV at 
pH 8.5 (4542 pmol glucose-min~ i -(mg protein)" 1 ) was ap- 
proximately 8.5 times higher than that of the best of the six 
PCR mutants, whose specific activities ranged from 47 to 
534 jimol glucosemln~ l (mg protein)" 1 . Substitution I97S, 
which was common to all the mutants, conferred retention of 
a small amount of activity at pH 8.5 (xyn-09D versus xyn- 
CD/WT). Except for mutant xyn-C2, combining substitution 
197 S with one or two others resulted in specific activities at 
pH 8.5 that were 5.4-1.1.4 times higher than that in the sin- 
gle substitution mutant (xyn-09D). Further, compilation of 
seven amino acid substitutions in a composite mutant (xyn- 
CDBFV) increased specific activity at pH 8.5 by over 96 
times that exhibited by the single substitution mutant. It may 
be that some or all of the seven amino acid substituants in 
xyn-CDBFV exert synergistic positive effects on the 
alkalophilicity of the xylanase. One of the substitutions, 
L208F in the xyn-D2 mutant, appeared to have a negative ef- 



Fig. 5. Thermostability of xyn-CD/WT (•) and xyn-CDBFV (■) 
a I 60 EC, determined at pH 6.0 (citric acid buffer). Activities are 
plotted as percentages of the maximal activity, which was ob- 
served in the 10-min incubation, for boih enzymes. Values shown 
are the means of quadruplicate determinations. Bars indicate 
standard deviations. Where not visible, bars fall within symbols. 
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Fig. 6. Alkaline resistance of xyn-CDBFV xylanase at pH 10 in 
0.1 M 2-amino-2-rriethyl-l-propanol (AMP) buffer. The xyn- 
CDBFV protein samples were in pH 1.0, 0.1 M AMP buffer and 
incubated for 0, 2. 4. 6, 12 T 24 h at room temperature. Four 
aliquots of protein samples were assayed for each time course. 
The activity produced by 0 h alkaline treatment samples is re- 
garded as the 100% control. The average specific activity of 
0-h samples was about 67 umol glucose-min~ , *(rng protein)' 1 . 

180 - 
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Incubation time (h) 

feci on alkalophilicity. A composite mutant containing all 
eight substitutions had lower alkalophilicity and lower spe- 
cific activity at pH 8.5 than did composite mutant xyn- 
CDBFV, which did not contain L208F (data not shown). 

Three of the seven amino acid substitutions of xyn- 
CDBFV xylanase (E24V, SI28F and G214V) raised the de- 
gree of hydrophobic ity of the enzyme, whereas two others 
(V218A and I97S) lowered it. It appears that the degree of 
hydrophobicity at some specific sites may determine the de- 
gree of alkalophilicity of the xylanase. The N164D substitu- 
tion increases the negative charge of the xylanase, which 
may also help increase alkalophilicity. Russell and Fersht 
(1987) described how an increase in the negative charge on 
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the surface of the model enzyme subtilisin raised the pK A 
values of acidic groups and how, with the increased pK. d of 
the catalytic acidic amino acid, the enzyme can shift its pH 
optimum to alkaline values. Judging from amino acid se- 
quence alignment from other xylanases (Gruher et al. 1998), 
the two acidic amino acids EM7 and E2I0 (Fig. 1) are the 
catalytic sites of xyn-CD/WT. As well, the K.168N substitu- 
tion can raise the negative charge indirectly by removing the 
positively charged lysine, thus acting cooperatively to raise 
the alkalophilicity of the xylanase together with other substi- 
tutions. Thus, the observations made in the present study are 
consistent with the ideas presented by Russell and Fersht 
(1987). 

Temperature optima of xyn-CD/WT and xyn-CDBFV 

Thermophilicity and thermotolerance of xyn-CDBFV 
were greatly enhanced as compared with its parent enzyme 
xyn-CDAVT. The composite mutant enzyme exhibited maxi- 
mal activity at 62°C compared with the temperature opti- 
mum of 47°C determined for the wild-type xyn-CD/WT 
(Fig. 4). At 60°C the specific activity of xyn-CDBFV 
(13 166u;mol glucose- min -(nig protein)" 1 ) was 130% of its 
activity at 47°C (Table 1; Fig. 4), and at 77°C xyn-CDBFV 
still maintained approximately 10% of its maximal activity. 
By comparison, the maximal activity of xyn-CD/WT re- 
corded at 47°C (4343 jxmol glucose-min~ , -(mg protein)" 1 ) 
was reduced to 2% at only 57°C. 

Assays for thermostability revealed that maximal activi- 
ties of both xylanases were exhibited after a 10-min incuba- 
tion at 60°C (Fig. 5). The activity of xyn-CD/WT was 
reduced to 50% of its maximum after only 60 min at 60° C, 
whereas xyn-CDBFV retained 50% of its activity for 
78 min. Thus, xyn-CDBFV is slightly more thermostable at 
60°C (at pH 6.0) than is its wild-type parent, xyn-CD/WT. 
At 55°C, both xyn-CDBFV and xyn-CD/WT exhibited over 
90% of their maximal activities, even after 120 min of incu- 
bation (data not shown). 

Resistance of xyn-CDBFV to alkaline conditions 

The specific activity of xyn-CDBFV increased to 150% of 
its 0-h activity (67 jamol glucose-min^^mg protein)" 1 ) dur- 
ing the initial 2 h of incubation at pH 10.0 (Fig. 6), and this 
enhanced activity was maintained for at least 24 h. Thus, the 
alkaline stability of xyn-CDBFV at pH 10.0 was clearly 
demonstrated. Comparisons with xyn-CD/WT could not be 
made, however, because the latter enzyme exhibited no 
xylanolytic activity at that pH (data not shown). Conse- 
quently, specific contributions of the seven mutations to the 
property of alkaline stability could not be elucidated. 

In conclusion, the techniques used in this study (error- 
prone PCR to generate mutants, agar overlay to screen for de- 
sired activities, DNA sequencing to identify sites and muta- 
tions, and site-directed mutagenesis to create composite 
mutants) were extremely effective for modifying a thermo- 
philic (half time at 60°C was 60 min) xylanase from 
N. patriciarum for use in paper pulp bio-bleaching. The final 
composite mutant (xyn-CDBFV) maintained a similar propor- 
tion of activity as compared with the parent enzyme at 
pH 4.0-6.5, but the proportion of activity it retained at 
pH 7.5-9.0 was much higher than that of the wild type. The 
enzyme was stable at pH 10.0 for at least 24 h, and its tem- 



perature optimum at pH 6.0 was shifted from 47°C (wild 
type) to 62°C. The specific activity of the composite enzyme 
was about 2.3 times that of the wild type at pi I 6.0. Whereas 
xyn-CDAVT was nonfunctional at pH 9.0, xyn-CDBFV ex- 
hibited specific activity of over 2500uniol glucose*min" , -(mg 
protein)" 1 . At 60°C composite xyn-CDBFV maintained its 
activity longer than did xyn-CDAVT. These properties render 
xyn-CDBFV greatly improved over xyn-CD/WT for use in 
bio-bleaching, where temperatures of 50 or 65°C and alkalin- 
ity of pH 7.0 or 9.0 would be encountered (Shoham et al. 
1992; Bim and Franco 2000; Gcoris et al. 2000). 

The success of this study demonstrates that a high pH 
agar overlay is a powerful screening technique that may be 
applied to other xylanases and other enzymes. Modifications 
of naturally occurring thermophilic enzymes could enhance 
their usefulness in a range of industrial applications not lim- 
ited to the paper pulp industry. Some of the present observa- 
tions suggest that the degree of hydrophobicity at some 
specific sites of the xylanases may determine their alkalo- 
philicity. Further research is necessary to better define the 
relationships between these characteristics, in this and other 
xylanases or other enzymes. 
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The three-dimensional structure of endo-i,4-jS-xy]anase 
II (XYNII) from Trichoderma reesei has been determined 
by X-ray diffraction techniques and refined to a 
conventional tf-factor of 18.3% at 1.8 A resolution. The 
190 amino acid length protein was found to exist as a 
single domain where the main chain folds to form two 
mostly antiparallel /3-sheets, which are packed against 
each other in parallel. The /3-sheet structure is twisted, 
forming a large cleft on one side of the molecule. The 
structure of XYNH resembles that or Bacillus 1,3-1,4-0- 
glucanase. The cleft is an obvious suggestion for an active 
site, which has putative binding sites for at least four 
xylose residues. The catalytic residues are apparentlv the 
two glutamic acid residues (Giu86 and Glul77) in the 
middle of the cleft. One structure was determined at pH 
5.0, corresponding to the pH optimum of XYNII. The 
second structure was determined at pH 6.5, where 
enzyme activity is reduced considerably. A clear 
structural change was observed, especially in the position 
of the side chain of Glul77, The observed conformational 
change is probably important for the mechanism of 
catalysis in XYNII. 

Key words: conformational change/crystal structure/ 
Trichodennaf Kyhnasc 



Introduction 

Xylan is the most abundant hemicellulose in plants, 
accounting for as much as 357c of the dry weight of higher 
plants. Xylans are heteropqlysaccharides, the backbone of 
which is composed of -150-200 j0-l ,4-linked xylo- 
pyranose units. The degree of substitution of xylan varies, 
depending on its botanical origin. The substituents, like 
arabinose. 4-0-Me-D-glucuronic acid and acetic acid, are 
bound covalently to the xylopyranose units. The action of 
several hemicelluiolytic enzymes is needed to hydrolyze 
xylan completely.. During the last few years, the enzymatic 
hydrolysis of xylan has raised significant interest because 
of its applications, for example, in biobleaching and the food 
and feed industry (Biely, I9S5; Poutanen ~et al , 1987; 
Viikarir/ «/.. 1990: Wong and Saddler, 1992; Gilbert and 
Hazlewood. 1993). 

The enzymes acting on the backbone of xylans are 
cxo-1.4-/5-xylosidnses (EC 3.2.1.37) and endo-1 .^-xylan- 
ases (EC 3.2. 1 .8). Endoxylanases are capable of hydrdlyzing 



the internal l,4-£-bonds of the xylan backbone and thereby 
produce several xylo-oligomers of varying length. About 2\) 
amino acid sequences of different low-molecular-weight 
xylanases of bacterial and fungal origin have been published. 
The comparison of available 0-gIycanase sequences by 
hydrophobic cluster analysis (HCA) has revealed eiaht 
families (A-H) (Gilkes etal, .1991; Henrissat, 1991; 
Torronen etal, 1993a). So far, X-ray structures of five 
glycosidic enzymes have been reported: die catalytic domain 
of cellobiohydrolase II from Trichoderma reesei (family B) 
(Rouvinen et al , 1990), the catalytic domain of endoglucan- 
ase E2 from Thertnomonospora fiisca (family B), endo- 
glucanase CelD from Clostridium thermocelhtm (family Ej 
(Juy et al, 1992), endoglucanase V from Humicola insolens 
(Davies etaL, 1993) and Bacillus H(A16-M) hybrid 
1,3-1 ,4-glucanase (Keitel et al. , 1993). 

Low-molecular-weight xylanases belong to the 0-gIycanase 
family G, and their amino acid sequences are highlv 
conserved (Torronen et al, 1993a). The crystallization of 
seven different low-molecular-weight xylanases has beeii 
reported (Moriyama et al, 1987; Rose et al, 1987; Golubev 
et al , 1993; Pickersgill et al , 1993; Torronen et al, 1993b; 
Viswamitra etaL, 1993). However, the only strucrurai 
information published so far concerns the folding pattern of 
IPO endo-1 ,4-/3-xylanase from Bacillus pumiius (Arase 
etal, 1993). 

The filamentous fungus T.reesei is known to be one of 
the most effective producers of polysaccharide hydrolyzing 
enzymes (Kubicek et al, 1993). The two major inducible 
homologous endo-/3-K4-xylanases (EC 3,2. 1 .8) secreted b\ 
T.reesei are XYNI and XYNII (Torronen et al, 1992). They 
are both small protein molecules with molecular masses of 
• 19 and 21 kDa, and consisting of 178 and 190 amino acids, 
respectively. The isoelectric point is 5.2 for XYNI and 9.0 
for XYNII. The pH optimum and product distribution far 
XYNI and XYNII are different. XYNII alone represents 
>50% of the xylanolytic activity of the culture filtrate of 
7~ reesei grown on xylan. We have recendy crystallized bodi 
enzymes (Torronen et al , 1993b) and now report the three- 
dimensional structure of XYNII. 

Results 
Crystallography 

The three-dimensional structure of XYNII has been solved 
using the multiple isomorphous replacement method 
augmented with anomalous scattering information. The final 
model contains two xylanase molecules (called A and B) in 
an asymmetric unit. All the 190 amino acid residues and 
1480 atoms for both enzyme molecules are included in the 
model. The first residue in both molecules is pyrrolidone 
carboxylic acid. The number of water molecules in the final 
model is 344 (0.9/residue). The final model contains a total 
of 3304 atoms. The r.m.s, fit* between the two molecules, 
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Fig. 4. Soheiu-acccs.MMc surOic of the cleft in XYNII structure. 
Two catalytic T-jidiUM and Glul77 are shown. 



Fiji. 1. fAf A ponion of the final mixW with electron density (coefficients 2F obi -F i: . jic . Q . 4k . contoured at la level) around Tvr87. (B) The same 
rcaion of the original <u nave raged} 2.8 A MIR map with the re fined model. 

A and B, is 0.231 A (calculated from the positions of Cor 
a:oms). The /?- factor is 18.3Vc for 24 911 reflections with 
F > la- ;fj between 8.0 and 1.8 A (757c complete). The 
overall Geometry is very eood (0.008 A r.m.s. distances, 
2.5* r.m.s. angles aiv.i0.v : r.m.s. for fixed dihedrals). The 
real-space correlation coefficient fJones eraL. 199 J >. 
calculated for th* final model using all atoms as a function 
of residue, showed good values. The coefficient was smallest 
for residues 164-165 in molecule A (0.64). In molecule 
B. tnese residues had a much stronger electron density. 
Asp! 40 was the only residue which fell outside the allowed 
regions on the Ramachandran map. A portion of the final 
electron density man together with the corresponding MIR 
map is shown n Figure I . 

The second model for XVNTI was calculated usingthe data 
from crystal soaked in a solution containing xylose at pH 
6.5. The final model contains 3304 atoms, the fl-factor is 
lS.4v. for !7 6S4 reflections with jFj > \o \F\ between 
£.0 and I .8 A • 5.3.7 % complete. 84. 1 % complete at 2.5 A 
re.-olu'.ion,!. The average dev iation from the ideal In bond 
lengths is 0.009 A. in bond angles 2.7 3 and in fixed dihedral 
angles 0.9 \ The r.m.s. difference between the two XYNII 
mode!- is 0.148 A and was calculated from the positions 
of all C'.t atoms, in the molecule A. 

Molecular structure 

The schematic drawing and naming of the secondary 
>;ru-:rure elements are presented in Figure 2A. The overall 
shape fit" fni> globular protein is ellipsoidal, having 
approximated dimensions o; 52 >:. 34 y 42 A. XYNII has 
IS^J a:; iim t acid>; it i> a single-domain polypeptide containing 
two :->heet- (A and Bj and one a-helix (Figure 3). The 
m:*u:ilto contains a total of 15 -3-strands. All /3-strands have 
an:iparallel hydrogen bunding, except B6b and B7, which 
h.i\c mixed hcinding. Both o- .sheets A and 8 twist. and form 
a :!e-t .>r. one side nf the protein. The hydrophobic faces 
oi thrxe J-sheeis .ire packed against each other in parallel 
in a sarvdw ich-hke manner, torming the hydrophobic core 
of'.he protein. The hulrophdic face o! ji-shect A forms a 
ibt surface with many .serine and threonine residues, and 
is accessible to solvent, The hydmphilic face of /i-sheet B 
makes a surface for the cleli. 
.'-Sheet A is composed ot the amiparallel tf-strands. The 




V\». 5. T!il- ..i^...,,-. 
lavUKjent- .7 ,' 



I v 'Mi i.. it "I he I'Kvttni! density 
\' .it I i iru-h .m-tiinl Glut 77 

pi I * ; V,i,:\rt IO .III*. 



putative ,) M-.md \! iiiis\iii : .; ui XYNII. but niay exist 
in .some othci liom. :l..gmis \\ kinases. Strand* A2. A3 and 
Ab are shmtei li.uu oi loe i.-mJ-.ic*) lliati the middle strands 
A-l and -\S mine .i;»d 11 ie-.idues|. The .short strands are 
quiii.- pl.in.ii . uheie.r. Hie Ion- strands are twisied. Four 
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Fly *» The schematic .drawine of the secondary' structure elements 
<A) XYNII from Zrcesei and"(B) Bacillus U- l,4-/5-glucanase. The 
arrows mark 0-sirands and cylinder o-hclix. The structure elements of 
l t 3_ i ,4.^-gtucanase arc adopted from the work of Keitel et al. 
(1993).' from which the small helix between strands B4 and A4 is 
omitted for claricv. The corresponding elements between two proteins 
arc numbered in a similar way. The common central folding pattern in 
both proteins is boxed. 

strands are connected at the same end of /3-sheet A to 0- 
sheet B. These loops have different lengths (2-5 residues). 
The connection between A6 and B6 is interrupted by a long 

insertion. , « , m j 

0-Sheet B contains 0-strands Bl -B9. Strands Bl, B2 and 
B9 are short (four, six and four residues), and B3, B4 ( B;> 
and B8 are Ionizer (seven, eight, nine and nine residues). 
One residue divides unit B6 into two shorter strands, B6a 
and B6b. which are twisted -90°. There is also a twist of 
-90° in the middle of the long 0-strands B3, B4 and B5. 
The twisted part of /3-sheet B has been described as a separate 
sheet in xylanase from B.pumilus (Arase et al, 1993). The 
insertion between strands B6 and A6 contains three 0- 
strands: B7, B8 and B9. The only or-helix is located between 
/3-strands A6 and B4, and is packed against the hydrophobic 
face of /3-sheet B. ( m 

The overall structure has the shape of a right nana 
(Figure 1 A and B), where the two /3-shccts (A and B) form 



'fingers' and the twisted part of /3-shcet B and the a-helix 
form a 'palm' (Figure 3A). The long loop of the nine 
residues between /3-strands B7 and B8 makes a 'thumb \ 
which partly closes the cleft. A striking feature is a 12 residue 
long irregular loop between /3-strands B6a and B9. A part 
of this, residues 96-102 (Tyr-Asn-Pro-Ser-Thr- 
Gly-Ala), form a 'cord', which partly closes the cleft on 
one side. The residues in the cord have a clear electron 
density, indicating that this unit has a well-defined structure. 
No hydrogen bonds between the cord and the rest of the 
molecule were observed, but some hydrophobic contacts 
were obvious. 

XYNII contains no cysteine residues, so there are no 
disulfide bridges. However, there are eight negatively 
charged (aspartate and glutamate) residues and 13 positively 
charged (lysine, arginine and histidine) residues in the protein 
molecule. These form ionic bridges: between AspHO and 
Arg81, Glu91 and Argl41/Argl45, Glui07 and Lysl04/ 
Argl42, and Asp 1 16 and Lysl04/Argl42. None of these 
residues is completely conserved. Lys49 has an important 
structural role. It places a positively charged nitrogen in a 
position inside the protein and makes hydrogen bonds with 
the- main chain carbonyl oxygens of AsnlO, G!y30, Gly32 
and Gln34. 

The active site cleft and conformational change 
The shape of the cleft was estimated by using the solvent- 
accessible surface (Voointholt ei al. ,. 1989) (Figure 4). The 
active' site is likely to be located in this cleft, which has a 
lenoth and depth of -25 and 9 A, respectively. The average 
width of the cleft is -4 A , but there are two regions where 
the width is smaller: one between Trpl8 and Pro 126 m the 
middle of the cleft, and the other between Tyr96 and Tyrl/9 
at the end of the cleft. 

XYNn was crystallized at pH 5.0, which also corresponds 
to the pH optimum of the enzyme (Tenkanen et al , 1992; 
Torrdnen et al, 1992). In order to get a complex structure, 
crystals were soaked in a solution containing xylose at pH 
6.5. At this pH, the activity of XYNII is considerably 
reduced and xylose binding was- expected to be better. Clear 
structural changes for some residues in the cleft were 
observed in both molecules A and B (Figure 5). 
Unfortunately, no clear electron density was found for the 
li°and, probably due to the low binding affinity of xylose. 
Later, the effect of pH was studied by collecting a new data 
set at 2.0 A resolution at pH 6.5, but now without xylose 
(data not shown). Again, similar conformational change was 
observed. . 

The largest chanae was observed for Glul77, w" er * 
torsion ansle between Q3 and £7 atoms, was changed - W 
(from -89° to 17 t e ). This places the carboxyl part of the 
side chain in a different position, the displacement being -.a 
A calculated from the positions of the C5 atom. Jnadduioa 
a chanee in the conformation of the side chain of Tyr73 ana 
Tyr88 can be detected. The bond between the atoms Ca and 
Q8 of Tyr73 rotates -78' (^55° to -23 ). and *e 
bond between the atoms QJ and C 7 of Tyr73 -47 Orom 
8^° to 129°). Both rotations cause a change or ~*.u * »j 
the position of the O, atom of Tyr73 The position of £ 
side chain of TyrSS is less affected. The bond between the 
atoms Ca and C& of Tyr88 rotates -9° (from -58 o 
-67*) and the bond between the atoms CIS and I C T of 1> n» 
' - 12° (from 83° to 71°). These rotations lead to an u. 
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Br. 3. (Aj Ribbon representation of the XVN'II molecule showing an a-hclix and /3-sirands. The structure is reminiscent of the shape or a 'right 
ha.-kj' (Bi a^d this analogy is shown. The picture was generated with the MolScript program (Kraulis, 1991). (C) The Ca skcteton of XYN1I in 
si^reo. The puutivc. catalytic residues f'J)uS6 and Glul77) and aromatic binding residues (Tyr96, Tyrl79 and TrplB) are shown. (Dj As in (C), but 
rro:rd along the v-a.\i* ~9Q*. 
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Fig. 6. Hydrogen bond network around Glu86 and Glul77 at (A) pH 5.0. (U) pH 6.5 and (C) hoih together with the main Co chain. 



A change in the position of Ojj of tyr88. In spite of the 
-0.6 A change in the position of the 4 cord\ the differences 
between the two structures elsewhere in the protein molecule 
were small, close to the experimental errors. 



The hydrogen bonds of Glu86 and Glu177 
In previous structural studies it has been suggested that C!u9o 
and Glul82 are the catalytic residues in B.pumilus IPO 
xylanasc (corresponding to Glu86 and Glul77 in XYNH) 
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DCb Ufa 'cord' D» Dl *tfausb* 87 
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C j K RTjCT I a ... V S Kit r AAWES f. CM? LC KM . HETAfii 1 £3y ? SMIt ADVtlS Hi 1 :1 1 C R i ! : 

Ii7 RpfTTi:ttATiTFT:iir,7!AKKSHCKiiu:!:iri<Aif(7 , m».*rr2Y^S£CSSf/"/r.*-< t^'. 

r>TS<rr/6. . .vsAiiFPKWE5LCMFHSRM.YrrAFrjr3Y3SS3«A!/ , AfriJ0t-F[c:) ;:: 

Si r.'/TMKT. ITTCS'JIFOAWARAOC^OFRYYMrKATrCY^iSCSStJir/SC 

iy:i H^scr.-!!. . .TAjiHFiuwAoc^iTLCiWD.Yor/A-.ntrsssfAsir/s ; • 

At Kails B* A* 



Fig. 7. Multiple sequence alignment among endo-M-0-xylanases 
(famiJy G). The totally conserved amino acids are in bold-face type 
and the stars indicate the putative catalytic residues: The secondary 
structure assignment of XYNI1 (7VII) is indicated by arrows (see 
Figure 2). The positions of the 'thumb* and 'cord' are marked. Ak, 
Aspergillus kawachii (Itn et ai, 1992); Trl, Trichodenna reesei XYNI 
(Tbrronen et ai , 1992); Cla, Clostridium acetobutylicwn (Zappe et ai. . 

1990) : Be. Bacillus circutans (Yang et ai, 1 938): Bp, Bacillus pitmilus 
fFukusaki etal, 1984); St. Streptomyces lividans (Shareck et ai, 

1991) ; Tr/l, Trichodenna reesei XYNII (TSrraneri et ai, 1992). 



(Ko et ai , 1992). The conformational change in XYNIT 
observed in this study alters the hydrogen bonding pattern 
of both residues (Figure 6). In the pH 5.0 structure, the Od 
of Glul77 forms a hydrogen bond to the N62 of Asn44, but 
the Oe2 does not form any hydrogen bonds. Thus, it' is 
possible that Oe2 has a proton and Glul77 is neutral in.this 
conformation. In the other (pH 6.5) structure, the position 
of the carboxyl group shifted. There Oei still forms a weak 
hydrogen bond to N52 of Asn44, whereas strong. hydrogen 
bondings to O17 of Tyr88 and to one water molecule were 
observed. GIul 77 is likely to be in a charged form in this 
second observed conformation (pH 6.5), because both its 
Qel and Oe2 accept hydrogen bonds. 

No clear conformational differences were detected in the 
position of Glu86 in these two structures. However, some 
changes were observed in the hydrogen bonding of Glu86. 
In the native structure, Qel of G1U86 makes a strong 
hydrogen bond to Ne2 of Gin 136. Oe2 forms one hydrogen 
bond to Or) of Tyr 77 and one to Or) of Tyr88. The hydrogen 
bond network continues from Or] of Tyr77 to Nel of Trp79 
and to Otj of Tyr 171. In the other (pH 6.5) structure, the 
distance between Oe2 of Giu86 and 0>; of Tyr88 increases 
from 3.3 to 4. 1 A ? because of the* position shift of 0.9 A 
of 0?/ of TyrS8. and the hydrogen bonding disappears. To 
compensate for this loss, the hydrogen bond between Oc2 
of GI1186 and Nt2 of Gin 136 shortens from 3.0 to 2.9 A. 
The link between the putative catalytic residues Glu86 anJ 
Glul77 is a hydroxy! group of Tyr88. The proton attached 
to 0>? of fyr88 points towards Giu86 in *pH 5.0' 
conformation and towards Glu 1 77 in *pH 6.5' conformation. 




Fig. 8. The superimpositi»n.nl" XYNII from T.r?es?i (green) arid 
1.3- l.-i-J-glucanasc from Raa'lhts iytWow). 

Discussion 
Family G xylanases 

Glyeanase family G was first described 'by Henrissat (1991). 
Recently, hydrophobic duster analysis was used to analyze 
the amino acid sequences of 17 different low-molecular- 
weic'ht endo-/3- 1 .4-xylanases (Torronen et erf. . 1993a). The 
amino acid homology in this family is high, indicating a 
similar fold. The alignment of seven amino acid sequences 
in this family is shown in Figure 7. together with the 
keen Jar. .structure assignment of .XYNII. XYNII has a total 
of 32 residues (17^). which are common in all these 
proteins. Both the putative catalytic residues Glu 86 and 
Glu! 77 of XYNII are conserved. There is a clear cluster 
of conserved residues around G!uS6. The direct hydrogen 
bonds to GluH6 (from Gin 136. Tyr77 and Tyr88) are 
conserved, as is a second layer' hydrogen bond from Trp79 
to Tyr.77. The hydrogen bond from Tyrl7l to Tyr77 also 
exists in other xylanases. although the tyrosine residue is 
replaced by histiJine in some members of the family. This 
cluster of five residues around'.Glu.Sft forms a hydrogen bond 
network. v : hicli i> capable of "fine tuning.' the properties of 
Glu 86. 

The >itua;ion is different around GIul 77. where the 
.surrounding is much less conserved. If Glu 1 77 initiates the 
react';- 1:1 h\ donating a hydrogen. I he pK u value of its 
ioni/.: : ve side chain d:rectl> if fleets the catalytic properties 
01 lit. wh»'!e cn/wnc. The pK, value for *lree* glutamic 
a.; id 1^ -5.0. XYNII ha* been reported to I v. most active 
ii V -i high;- pll range: 5.'.i iTcnkaucn et ai, 1992; 
T'o: t t ill.. IW2i. liiothci \v kinases. ilic[)H optimum 
ha - kvn c>tim;'itcd to \aiy Ivivuvn .Vr» atul 7.tl I Wongcv (ti . 
I4 v )jt The variations arc proluhl> ilue to the different amino 
acid ivMduc.N in (he neighhorhood of Glu 177. 

Mai:> ol the eonsened residuum od'eutlo- 1 .4 -p'-xv kinases 
ate lielteved to he .struct 11 rails important in order to confirm 
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Tabic I. Data collection and MIR statistics 



Crystal and soaking data 



Data set 


Soaked compound 


pH 


Soaking 


Soaking 




Unit cell dimensions 












cone. (m.M). 


time (h) 
























Q (X) 


b (AS 


c(M 


0C) 


Native 




5,0 








81.57 


60.63 


38.25 


94.4 


PTl 


K : Pt(CN) 4 


5-0 


5Q 


17 




81.69 


60.82 


38.10 


94.5 


m 


K;Pl(GN) 4 


5.0 


50 


17 




81.51 


60.72 


38.05 


9-M 


HGl 


Hg(OAc), + thioxylose 


5.0 


20 


20 




81.71 


60.96 


.38,40 


94.4 


hg: 


HgfOAc) 3 + thioxylosc 


8.0 


20 


10 




81.35 


60.67 


38.23 


94.4 


HG3 


MeHgOAc + thioxylose 


5.0 


20 


18 




81.47 


60.72 


38. 16 


94.4 


Naiive3 


xylose 


6.5 


10 


22 




81.70 


60.96 


38.05 


94.3 


Diffraction data 


















Data set 


dmin (A) 


No. of 




No. of unique 




Completeness 










measurements 


reflections 




of data (ft) 


(%) 




Native 


1.8 


84 025 




25 322 






75 


4.95 




PTl 


2.0 


77 296 




21 886 






86 


7.31 




pt: 


2.5 


38 961 




12 114 






93 


7.63 




HGI 


2.5 


43 235 




12 336 






94 


4.91 




HG2 


2.5 


29 763 




!1 047 






85 


10.74 




HG3 


2.0 


45 605 




19 484 






77 


8.42 




Native2 


l.S 


44 107 




18 071 






54 


8.77 




MIR analysis 






















Occupancy 


X 


V 




2 




R.m.s.fh/residual /? CuMi , 




PTl -r PT2 


31.01 


0.620 


0.195 




0.102 




1.15 


0.853 






23.64 


0.144 


0.757 




0.421 












10.51 


0.592 


0.185 




0.413 












9.07 


0.0S9 


0.751 




0.109 










HGI 


18.23 


0.197 


0.000 




•0.111 




1.53 


0.723 






20.93 


0.694 


0.942 




0.454 












S.97 


0.588 


0.536 




0.272 












S.05 


0.080 


0;396 




0.262 










HG2 


27.59 


0.198 


0.009 




0.113 




1.77 


0.60S 






30.64 


0,694 


0.948 




0.545 












18.05 


0.589 


0.541 




0.272 












17.55' 


0.078 


0.404 




0.264 










'HG3 


27.11 


0.269 


0.509 




0.442 




0.99 


0.808 






23.72 


0.237 


0.945 




0.022 











Mean fisurc of merit » 0.50 



w hcre ft*,., - 100 x ZL \F-ii)- <F-(\)>\l EE^(i), R.m.s.Wrcsidual « [&F H i)/(ZF 0Bn -F Pii W n - t where F H is the heaw atom form factor, 
F DEZ and F Pl{ arc the derivative structure factor and calculated structure factor. H CMU = ~(I^der *" ^phI)'^(1^dek ^^*nat^* vvhcre *W is the 
native structure factor and the summation is taken over the centric reflections. 



its position. This feature could be useful in catalysis, where, 
the glutamic acid residue has a dual role; First, it will initiate 
the reaction by donating a proton to the substrate; second, 
it may assist (in a charged form) a water molecule to attack 
the substrate. XYNFt has been reported also to have 
iransxylosidase activity (Tenkanen et al. , 1992). The second 
fpH 6.5) conformation is probably involved in that type of 
catalysis. 

Materials and methods 

Crystallization and data collection 

Purification and crystallization ofnativc.cndo-1.4-j3-xylana.se II have been 
described by Tdrronen ei oi (1992, 1993b). The crystals arc monoclinic. 
belonging to the space group P2,. Intensity data wore collected from XVKII 
crystal* on a Rigaku R-AXIS IIC area detector using CuKu radiation 



produced by a Rigaku RU200HB rotating anode (50 kV, 160 mA). The. 
area detector data were processed using R-AXIS IIC software. Data collection 
statistics arc given in Table I. Wc deduced from the unit cell volume that 
there arc two xylanase molecutcs in the asymmetric unit. The self- rotation 
function showed a sharp peak in the portion of * = 90". o = 86° and 
a- = 1 80 °, indicating the presence of a non-crysullographic two- fold clement. 

Structure determination 

In total, 52 different heavy-atom experiments were performed in order to 
find suitable derivatives for MIR phasing. The difference Patterson maps 
were calculated with the XtalView program. Only KiPt(CN)., was found 
to produce a clear substitution for heavy atoms. Later, thioxylnsc wns 
synthesized by replacing a hydroxyl group of CI carbon with a thiol group. 
The XYNII crystals were first soaked in thioxylosc solution and then 
transferred to a mercury aceiale solution. Difference Patterson maps showed 
clear peaks for mercury in the Harkcr section. The co-ordinates of heavy 
atoms were calculated from the Patterson maps and their positions were 
refined with the Protein program (Steigemann. 1992). The electron density 
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the correct folding and packing. In addition, there are three 
residues in which the conservation is interesting. One of 
them, Prp98 in trans-con formation, located in the middle 
of the 4 cord\ seems to be structurally important. It also 
addresses the question of whether or not its conformation 
is capable of changing from trans to cis. If this is possible, 
it could lead to a large structural change in the stucture. 
Naturally, this hypothesis shoujd. be demonstrated experi- 
mentally. Two other interesting conserved residues are the 
Asnl24 and Thrl 33 above the 'thumb' (Figure 3C). The side 
chains of these two residues have no effect on packing. Both 
of them point towards solvent without making any important 
hydrogen bonds to other residues of the protein molecule. 
The role of this conservation is unclear, although we can 
speculate that they take pan in substrate binding. 

The flat Ser/Thr face of /3-sheet A is also quite conserved 
in family G. The function of this face is difficult to 
understand, but it may take part in the penetration of the 
enzyme into small pores, which exist in wood components. 
Its functional role may, to some extent, be similar to separate 
cellulose-binding domains observed in many cellulases. In 
addition, it can be noted that the linker region between the 
cellulase catalytic and substrate-binding domains normally 
contains numerous serines and threonines, although many 
of them are believed to be glycosylated (Gilbert and 
Hazlewood, 1993). 

Comparison with Bacillus 1 ,3 - 7 A-0-glucanase 
The three-dimensional structure of the 0-glucan hydrolyzing 
enzyme, 1,3-1 ,4-0-glucanase (also called lichenase) from 
Bacillus has recently been reported by Keitel et al. (1993). 
The protein is a hybrid containing residues 1-16 of the 
mature Bacillus amyloliquefaciens enzyme and residues 
17—214 from the Bacillus macerans enzyme. The general 
shape of this molecule resembles the XYNU structure, as 
shown in Figure 8. The r.m.s. fit between the Ca atoms 
of 83 residues is .2.2 A, indicating a rather weak structural 
similarity. The reason for the common general shape, but 
weak local structural similarity, was found by comparing 
the topological diagrams of the secondary structure elements 
(Figure 2). The comparison shows that the central folding 
pattern (boxed in Figure 2) is similar, indicating the same 
kind of connection between the central four /3-strands. Both 
structures have an insertion between ^-strands B6 and A6. 
The differences are in the arrangements of the secondary 
structure elements located outside the central region. 

There are three cationic residues, Glu 105, Asp 107 and 
Glu 1 09, in the active site cleft of 1 ,3 - 1 ,4-;3-glucanase, all 
having side chains pointing towards the cleft. The study of 
site-directed mutagenesis has suggested that at least Glu 105 
is important for catalysis. Interestingly, all three residues 
are located in /3-strand B6, which corresponds to the B6b 
strand containing Glu86 in the XYNII structure. In 
comparison, the glucanase structure does not have a 
corresponding residue for the Glu 177 of XYNIL 

Implications for catalysis 

Without high-resolution complex structures, it is difficult to 
determine the ligand binding accurately. The soaking 
experiment with xylose, xylobiose and xylotriose at pH 5.0 
and 6.5 did not reveal the binding mode for the ligands. It 
seems that the binding affinity of xylo-oligomers is too low. 
For example, Ryazanova et al. (1993) have reported that 



D-xylose binds weakly to different Aspergillus japonicus 
xylanases (K t = 0.22-1.5 mM). 

However, by analyzing the conserved residues in the cleft, 
some assumptions can be made. In this respect, the most 
interesting residues are the aromatic residues, which have 
the hydrophobic face of the side chain on the surface of the 
cleft. In many carbohydrate-binding proteins, tyrosine, 
tryptophan or phenylalanine residues participate in the 
binding of a carbohydrate residue by packing against a flat 
face of the carbohydrate ring (Vyas, 1991). In the. cleft of 
XYNII, three residues of this kind exist: Trpl8, Tyr96 and 
Tyrl79 (Figure 3C and D). Trpl8 is located on /3-strand 
B2 at the entrance of the cleft, Tyrl79 on /3-strand A4 in 
the middle of the cleft and Tyr96 is located on the 'cord', 
quite near Tyrl79, but on the other side. By using the 
molecular model of the xylo-oligomer, we can estimate that 
there is space for at least four xylose units in the cleft 
(subsites which we describe as A, B, G and D), It is probable 
that Tyr96 and Tyrl79 are pan of adjacent subsites (A and B) 
for xylose residues, whereas Trpl8 is part of the fourth 
subsite (D) further away. Between residues Tyrl79 and 
Trpl8 there is space for one xylose residue (subsite C). In 
addition, we can note that one face of the phenyl ring of 
Phel80 is on the surface of the cleft, which might interact 
with the substituents of xylan, although this residue is not 
conserved. 

The site of cleavage is probably located between subsites 
B and C or between subsites C and D. Tenkanen*/ al. (1992) 
have studied the distribution of the produced xylo- 
oligosaccharides of different substituted and unsubstituted 
xylans in T,reesei xylanase-catalyzed reactions. The 
hydrolysis of unsubstituted xylan was quite slow, and small 
amounts of xylotriose and longer xylo-oligomers were 
produced. The hydrolysis of substituted xylan was more 
efficient, producing larger amounts.of xylo-oligomers. The 
main product was xylotriose. These data would suggest that 
the site of cleavage is between subsites C and D. 

The reaction mechanism of family G xylanases has been 
studied experimentally by Gebler et al (1992). This study 
has revealed that this family of enzymes are 'retaining'. By 
using the description of Sinnott (1990), the mechanism is 
e — e. This would mean that the leaving group and the 
reaction product are both in the equatorial position. Thus, 
the mechanism would belong to the same class as the widely 
studied and discussed reaction mechanism of the hen egg 
white (HEW) lysozyme. During the last few years, many 
variations for the reaction mechanism of HEW lysozyme 
have been suggested (Kirby, 1987). As in the HEW 
lysozyme, the catalytic residues of XYNII have carboxylic 
acid side chains, which are situated on both sides of the 
substrate. The distance between the carboxyl groups of 
catalytic residues is 7.2 A in the HEW lysozyme. The value 
has been obtained from the lysozyme structure refined in 
our laboratory (unpublished" data). The corresponding 
distances identified in this study are 10.7 A for 'pH 5.0" 
XYNII and 8.1 A for *pH 6.5' XYNII. In XYNII, a new 
and striking feature is the observed conformational change 
in the catalytic residue Glu 177. We could observe this change 
through a rising pH, which led to a loss of an acidic proton 
in Glu 1?7. We suggest that this conformational change also 
occurs in catalysis (at pH 5,0). The conformational change 
is able to alter the pK a value of the side chain of Glu 177, 
making it able to accept or donate a proton, depending on 
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maps were skeletonized with the Bones program and the result was studied 
with the program O (Jones rial, I99I). The first Bones maps were 
calculated using one platinum data set combined from two different crystals 
and two separate data seLs from different crystals soaked in mercury acetate. 
Two maps were calculated, corresponding to both hands.. One map was 
better, showing the overall shapes of four molecules in the unit ceil. However, 
it was riot possible to trace the chain completely from this map. Later, a 
new map was calculated with the data set from the crystal which was soaked 
in thioxylose solution and then in rhcthylmcrcury acetate. This map showed 
improved continuity and we were able to determine the fold of ihc protein 
and assign the sequence. The atomic model for molecule B was built with 
the G program. This molecule was then shifted visually by using.the Bones 
map to the approximate position of molecule A. The position of the model 
was refined by using the real-space refinement feature of the O program, 
and the co-ordinates for both molecules A and B were obtained. Later, the 
electron density maps were calculated to alt heaxy-atom derivative data sets 
using model phases. The electron density maps showed clearly that His 144 
was the principal binding site in all mercury souks. However, no density 
was found for thioxylose. The reason why mercury binding was obtained 
only in the presence of thioxylose remains unknown. 

Structure refinement 

The structure refinement was performed with the X-PLOR program (Bruhger 
a al. 19S7) with the aid of a graphical user interface developed in our 
laboratory. The initial model was refined by simulated annealing. Data in 
the range of 8.0-2.5 A were used in the first refinement. The standard 
slow-cooling protocol was used in the refinement, starting at 3000 K and 
performing 50 steps of molecular dynamics (time step 0.5 fs) and reducing 
the temperature by 25 K until 300 K was reached. In the first cycle, an 
/?-factor was decreased from 59.1 to 37.7% (^-factor = Ed/ 7 ^- 
F K \)IZ{F 0 ). The resulting model was studied with the O program, and 
manual corrections to the structure were made. The refinement cycles, later 
only with energy minimization, and manual rebuilding were repeated until 
the /?-factor decreased to 18.37c. Water molecules were added to the model 
when the R- factor was 24.4ft. 

The second XYNU model was determined by using the data set based 
on xylose soaked crystal (pH 6.5). The final model of the native XYNU 
(pH 5.0) was used as an initial model for the refinement. After some 
corrections in the side chain and water positions, the final model was 
obtained. The pictures in this paper have been created by the O (Jones et at. , 
1991). XtnlView (McRee. 1992) and MolScripi (Kraulis. 1991) programs. 
The co-ordinates will be deposited in the Brookhaven Protein Data Barik. 
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Noted added in proof 

Recently, Campbell and coworkers have solved the structures of t^o 
xylanases from family G. namely xylanases from Trichodenna harzianum 
and Bacillus circulans [Campbell.R.L., Rose.D.R.. Wakarchuk.W.W., 
To.R., Suns.W. and Yaguchi.M. (1993) In Suomincn.P. and Reinikainen.T. 
(cds). Proceedings of the Secotxd TRJCEL Symposiwn oh Tricliodenna reesei 
Celluloses ami Other Hydrolases. Foundations for Biouxhnieaj and Industrial 
Fermentation Research 8, Espoo, pp. 63-72). 
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